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Nucleotide Sequence of the Haemophilus influenzae Rd 
Genome, Fragments Thereof, and Uses Thereof 



Part of the work performed during development of this invention 
utilized U.S. Government funds. The government may have certain rights in 
5 this invention. NIH-5R01GM48251 



Field of the Invention 

The present invention relates to the field of molecular biology. The 
present invention discloses compositions comprising the nucleotide sequence 
of Haemophilus influenzae, fragments thereof and usage in industrial 
10 fermentation and pharmaceutical development. 

Background of the Invention 

The complete genome sequence from a free living cellular organism has 
never been determined. The first mycobacterium sequence should be 
completed by 1996, while E. coli and 5. cerevisae are expected to be 
completed before 1998. These are being done by random and/or directed 
sequencing of overlapping cosmid clones. No one has attempted to determine 
sequences of the order of a megabase or more by a random shotgun approach. 

H. influenzae is a small (approximately 0.4 x 1 micron) non-motile, 
non-spore forming, germ-negative bacterium whose only natural host is 
human. It is a resident of the upper respiratory mucosa of children and adults 
and causes otitis media and respiratory tract infections mostly in children. The 
most serious complication is meningitis, which produces neurological sequelae 
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in up to 50% of affected children. Six H. influenzae serotypes (a through 0 
have been identified based on immunologically distinct capsular polysaccharide 
antigens. A number of non-typeable strains are also known. Serotype b 
accounts for the majority of human disease. 

5 Interest in the medically important aspects of H. influenzae biology has 

focused particularly on those genes which determine virulence characteristics 
of the organism. A number of the genes responsible for the capsular 
polysaccharide have been mapped and sequenced (Kroll et al. , Mol. Microbiol. 
5(6): 1549-1560 (1991)). Several outer membrane protein (OMP) genes have 

10 been identified and sequenced (Langford et al. , 7. Gen. Microbiol. 138: 155- 

159 (1992)). The lipoligosaccharide (LOS) component of the outer membrane 
and the genes of its synthetic pathway are under intensive study (Weiser et al. , 
7. Bacteriol. 772:3304-3309 (1990)). While a vaccine has been available since 
1984, the study of outer membrane components is motivated to some extent 

15 by the need for improved vaccines. Recently, the catalase gene was 

characterized and sequenced as a possible virulence-related gene (Bishni et al. , 
in press). Elucidation of the H. influenzae genome will enhance the 
understanding of how H. influenzae causes invasive disease and how best to 
combat infection. 

20 H. influenzae possesses a highly efficient natural DNA transformation 

system which has been intensively studied in the non-encapsulated (R), 
serotype d strain (Kahn and Smith, 7. Membrane Biology 87:89-103 (1984)). 
At least 16 transformation-specific genes have been identified and sequenced. 
Of these, four are regulatory (Redfield, 7. Bacteriol. 775:5612-5618 (1991), 

25 and Chandler, Proc. Natl. Acad. Sci. USA 89: 1626-1630 (1992)), at least two 

are involved in recombination processes (Barouki and Smith, 7. Bacteriol. 
i<53(2):629-634 (1985)), and at least seven are targeted to the membranes and 
periplasmic space (Tomb et al. , Gene 104: 1-10 (1991), and Tomb, Proc. Natl. 
Acad. Sci. USA 89:10252-10256 (1992)), where they appear to function as 

30 structural components or in the assembly of the DNA transport machinery. H. 

influenzae Rd transformation shows a number of interesting features including 
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sequence-specific DNA uptake, rapid uptake of several double- stranded DNA 
molecules per competent cell into a membrane compartment called the 
transformasome, linear translocation of a single strand of the donor DNA into 
the cytoplasm, and synapsis and recombination of the strand with the 
5 chromosome by a single- strand displacement mechanism. The H. influenzae 

Rd transformation system is the most thoroughly studied of the gram-negative 
systems and distinct in a number of ways from the gram-positive systems. 

The size of H. influenzae Rd genome has been determined by 
pulsed-field agarose gel electrophoresis of restriction digests to be 

10 approximately 1.9 Mb, making its genome approximately 40% the size of E. 

coli (Lee and Smith, J. Bacterid 770:4402-4405 (1988)). The restriction 
map of H. influenzae is circular (Lee et al. f J. BacterioL 777:3016-3024 
(1989), and Redfield and Lee, "Haemophilus influenzae Rd", pp. 2110-2112, 
In O'Brien, S.J. (ed), Genetic Maps: Locus Maps of Complex Genomes, 

15 Cold Spring Harbor Press, New York). Various genes have been mapped to 

restriction fragments by Southern hybridization probing of restriction digest 
DNA bands. This map will be valuable in verification of the assembly of a 
complete genome sequence from randomly sequenced fragments. GenBank 
currently contains about 100 kb of non-redundant H. influenzae DNA 

20 sequences. About half are from serotype b and half from Rd. 

Summary of the Invention 

The present invention is based on the sequencing of the Haemophilus 
influenzae Rd genome. The primary nucleotide sequence which was generated 
is provided in SEQ ID NO: 1 . 
25 The present invention provides the generated nucleotide sequence of the 

Haemophilus influenzae Rd genome, or a representative fragment thereof, in 
a form which can be readily used, analyzed, and interpreted by a skilled 
artisan. In one embodiment, present invention is provided as a contiguous 
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string of primary sequence information corresponding to the nucleotide 
sequence depicted in SEQ ID NO:l. 

The present invention further provides nucleotide sequences which are 
at least 99.9% identical to the nucleotide sequence of SEQ ED NO: 1 . 

5 The nucleotide sequence of SEQ ID NO: 1 , a representative fragment 

thereof, or a nucleotide sequence which is at least 99.9% identical to the 
nucleotide sequence of SEQ ID NO:l may be provided in a variety of 
mediums to facilitate its use. In one application of this embodiment, the 
sequences of the present invention are recorded on computer readable media. 

10 Such media includes, but is not limited to: magnetic storage media, such as 

floppy discs, hard disc storage medium, and magnetic tape; optical storage 
media such as CD-ROM; electrical storage media such as RAM and ROM; 
and hybrids of these categories such as magnetic/optical storage media. 

The present invention further provides systems, particularly computer- 

15 based systems which contain the sequence information herein described stored 

in a data storage means. Such systems are designed to identify commercially 
important fragments of the Haemophilus influenzae Rd genome. 

Another embodiment of the present invention is directed to isolated 
fragments of the Haemophilus influenzae Rd genome. The fragments of the 

20 Haemophilus influenzae Rd genome of the present invention include, but are 

not limited to, fragments which encode peptides, hereinafter open reading 
frames (ORFs), fragments which modulate the expression of an operably 
linked ORF, hereinafter expression modulating fragments (EMFs), fragments 
which mediate the uptake of a linked DNA fragment into a cell, hereinafter 

25 uptake modulating fragments (UMFs), and fragments which can be used to 

diagnose the presence of Haemophilus influenzae Rd in a sample, hereinafter, 
diagnostic fragments (DFs). 

Each of the ORF fragments of the Haemophilus influenzae Rd genome 
disclosed in Tables 1(a) and 2, and the EMF found 5' to the ORF, can be used 

30 in numerous ways as polynucleotide reagents. The sequences can be used as 

diagnostic probes or diagnostic amplification primers for the presence of a 
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specific microbe in a sample, for the production of commercially important 
pharmaceutical agents, and to selectively control gene expression. 

The present invention further includes recombinant constructs 
comprising one or more fragments of the Haemophilus influenzae Rd genome 
5 of the present invention. The recombinant constructs of the present invention 

comprise vectors, such as a plasmid or viral vector, into which a fragment of 
the Haemophilus influenzae Rd has been inserted. 

The present invention further provides host cells containing any one of 
the isolated fragments of the Haemophilus influenzae Rd genome of the present 
10 invention. The host cells can be a higher eukaryotic host such as a mammalian 

cell, a lower eukaryotic cell such as a yeast cell, or can be a procaryotic cell 
such as a bacterial cell. 

The present invention is further directed to isolated proteins encoded 
by the ORFs of the present invention. A variety of methodologies known in 
15 the art can be utilized to obtain any one of the proteins of the present 

invention. At the simplest level, the amino acid sequence can be synthesized 
using commercially available peptide synthesizers. In an alternative method, 
the protein is purified from bacterial cells which naturally produce the protein. 
Lastly, the proteins of the present invention can alternatively be purified from 
20 cells which have been altered to express the desired protein. 

The invention further provides methods of obtaining homologs of the 
fragments of the Haemophilus influenzae Rd genome of the present invention 
and homologs of the proteins encoded by the ORFs of the present invention. 
Specifically, by using the nucleotide and amino acid sequences disclosed herein 
25 as a probe or as primers, and techniques such as PCR cloning and 

colony/plaque hybridization, one skilled in the art can obtain homologs. 

The invention further provides antibodies which selectively bind one 
of the proteins of the present invention. Such antibodies include both 
monoclonal and polyclonal antibodies. 



-5- 

SUBSTTTUTE SHEET (RULE 26) 



BMSDOCIDt <WO 9633276A1 I > 



WO 96/33276 



PCT/US96/05320 



The invention further provides hybridomas which produce the above- 
described antibodies. A hybridoma is an immortalized cell line which is 
capable of secreting a specific monoclonal antibody. 

The present invention further provides methods of identifying test 
5 samples derived from cells which express one of the ORF of the present 

invention, or homolog thereof. Such methods comprise incubating a test 
sample with one or more of the antibodies of the present invention, or one or 
more of the DFs of the present invention, under conditions which allow a 
skilled artisan to determine if the sample contains the ORF or product 
10 produced therefrom. 

In another embodiment of the present invention, kits are provided 
which contain the necessary reagents to carry out the above-described assays. 

Specifically, the invention provides a compartmentalized kit to receive, 
in close confinement, one or more containers which comprises: (a) a first 
15 container comprising one of the antibodies, or one of the DFs of the present 

invention; and (b) one or more other containers comprising one or more of the 
following: wash reagents, reagents capable of detecting presence of bound 
antibodies or hybridized DFs. 

Using the isolated proteins of the present invention, the present 
20 invention further provides methods of obtaining and identifying agents capable 

of binding to a protein encoded by one of the ORFs of the present invention. 
Specifically, such agents include antibodies (described above), peptides, 
carbohydrates, pharmaceutical agents and the like. Such methods comprise the 
steps of: 

25 (a) contacting an agent with an isolated protein encoded by one of 

the ORFs of the present invention; and 
(b) determining whether the agent binds to said protein. 
The complete genomic sequence of H. influenzae will be of great value 
to all laboratories working with this organism and for a variety of commercial 
30 purposes. Many fragments of the Haemophilus influenzae Rd genome will be 

immediately identified by similarity searches against GenBank or protein 
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databases and will be of immediate value to Haemophilus researchers and for 
immediate commercial value for the production of proteins or to control gene 
expression. A specific example concerns PHA synthase. It has been reported 
that polyhydroxybutyrate is present in the membranes of //. influenzae Rd and 
5 that the amount correlates with the level of competence for transformation. 

The PHA synthase that synthesizes this polymer has been identified and 
sequenced in a number of bacteria, none of which are evolutionarily close to 
H. influenzae. This gene has yet to be isolated from H. influenzae by use of 
hybridization probes or PCR techniques. However, the genomic sequence of 

10 the present invention allows the identification of the gene by utilizing search 

means described below. 

Developing the methodology and technology for elucidating the entire 
genomic sequence of bacterial and other small genomes has and will greatly 
enhance the ability to analyze and understand chromosomal organization. In 

15 particular, sequenced genomes will provide the models for developing tools for 

the analysis of chromosome structure and function, including the ability to 
identify genes within large segments of genomic DNA, the structure, position, 
and spacing of regulatory elements, the identification of genes with potential 
industrial applications, and the ability to do comparative genomic and 

20 molecular phylogeny. 



Description of the Figures 



Figure 1 - restriction map of the Haemophilus influenzae Rd genome. 

Figure 2 - Block diagram of a computer system 102 that can be used 
to implement the computer-based systems of present invention. 
25 Figure 3 - A comparison of experimental coverage of up to 

approximately 4000 random sequence fragments assembled with 
AutoAssembler (squares) as compared to Lander- Waterman prediction for a 
2.5 Mb genome (triangles) and a 1.6 Mb genome (circles) with a 460 bp 
average sequence length and a 25 bp overlap. 
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Figure 4 - Data How and computer programs used to manage, 
assemble, edit, and annotate the H. influenzae genome. Both Macintosh and 
Unix platforms are used to handle the AB 373 sequence data files (Kerlavage 
et al. Proceedings of the Twenty-Sixth Annual Hawaii International 
5 Conference on System Sciences, IEEE Computer Society Press, Washington 

D.C., 585 (1993)). Factum (AB) is a Macintosh program designed for 
automatic vector sequence removal and end trimming of sequence files. The 
program esp runs on a Macintosh platform and parses the feature data 
extracted from the sequence files by Factum to the Unix based H. influenzae 
10 relational database. Assembly is accomplished by retrieving a specific set of 

sequence files and their associated features using stp, an X-windows graphical 
interface and control program which can retrieve sequences from the H. 
influenzae database using user-defined or standard SQL queries. The sequence 
files were assembled using TIGR Assembler, an assembly engine designed at 
15 TIGR for rapid and accurate assembly of thousands of sequence fragments. 

TIGR Editor is a graphical interface which can parse the aligned sequence 
files from TIGR Assembler output and display the alignment and associated 
electropherograms for contig editing. Identification of putative coding regions 
was performed with Genemark (Borodovsky and Mclninch, Computers Chem. 
20 1 7(2): 123 (1993)), a Markov and Bayes modeled program for predicting gene 

locations, and trained on a H. influenzae sequence data set. Peptide searches 
were performed against the three reading frames of each Genemark predicted 
coding region using blaze (Brutlag et al., Computers Chem. 77:203 (1993)) 
run on a Maspar MP-2 massively parallel computer with 4096 
25 microprocessors. Results from each frame were combined into a single output 

file by mblzt. Optimal protein alignments were obtained using the program 
praze which extends alignments across potential frameshifts. The output was 
inspected using a custom graphic viewing program, gbyob, that interacts 
directly with the H. influenzae database. The alignments were further used to 
30 identify potential frameshift errors and were targeted for additional editing. 
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Figure 5 - A circular representation of the H. influenzae Rd 
chromosome illustrating the location of each predicted coding region contain- 
ing a database match as well as selected global features of the genome. Outer 
perimeter: The location of the unique Notl restriction site (designated as 
5 nucleotide 1), the RsrH sites, and the Smal sites. Outer concentric circle: The 

location of each identified coding region for which a gene identification was 
made. Each coding region location is coded as to role according to the color 
code in Fig. 6. Second concentric circle: Regions of high G/C content ( > 
42%, red; > 40%, blue) and high A/T content (> 66%, black; > 64%, 

10 green). High G/C content regions are specifically associated with the 6 

ribosomal operons and the mu-like prophage. Third concentric circle: 
Coverage by lambda clones (blue). Over 300 lambda clones were sequenced 
from each end to confirm the overall structure of the genome and identify the 
6 ribosomal operons. Fourth concentric circle: The locations of the 6 

15 ribosomal operons (green), the tRNAs (black) and the cryptic mu-like 

prophage (blue). Fifth concentric circle: Simple tandem repeats. The 
locations of the following repeats are shown: CTGGCT, GTCT, ATT, 
AATGGC, TTGA, TTGG, TTTA, TTATC, TGAC, TCGTC, AACC, TTGC, 
CAAT, CCAA. The putative origin of replication is illustrated by the outward 

20 pointing arrows (green) originating near base 603,000. Two potential 

termination sequences are shown near the opposite midpoint of the circle (red). 



Figures 6(A)-6(D)- Complete map of the H. influenzae Rd genome. 
Predicted coding regions are shown on each strand. rRNA and tRNA genes 
are shown as lines and triangles, respectively. Genes are color-coded by role 
25 category as described in the legend. GenelD numbers correspond to those in 

Tables 1(a), 1(b) and 2. Where possible, three-letter designations are also 
provided. 



Figure 7 - A comparison of the region of the H. influenzae chromo- 
some containing the 8 genes of the fimbrial gene cluster present in H. 
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influenzae type b and the same region in H. influenzae Rd. The region is 
flanked by the pepN and purE genes in both organisms. However in the non- 
infectious Rd strain the 8 genes of the fimbrial gene cluster have been excised. 
A 172 bp spacer region is located in this region in the Rd strain and continues 
5 to be flanked by the pepN and purE genes. 

Figure 8 - Hydrophobicity analysis of five predicted channel-proteins. 
The amino acid sequences of five predicted coding regions that do not display 
homology with known peptide sequences (GenBank release 87), each exhibit 
multiple hydrophobic domains that are characteristic of channel-forming 
10 proteins. The predicted coding region sequences were analyzed by the Kyte- 

Doolittle algorithm (Kyte and Doolittle, J. Mol. Biol. 757:105 (1982)) (with 
a range of 11 residues) using the GeneWorks software package 
(Intelligenetics). 

Detailed Description of the Preferred Embodiments 



15 The present invention is based on the sequencing of the Haemophilus 

influenzae Rd genome. The primary nucleotide sequence which was generated 
is provided in SEQ ID NO: 1. As used herein, the "primary sequence" refers 
to the nucleotide sequence represented by the IUPAC nomenclature system. 
The sequence provided in SEQ ID NO: 1 is oriented relative to a unique 

20 Not I restriction endonuclease site found in the Haemophilus influenzae Rd 

genome. A skilled artisan will readily recognize that this start/stop point was 
chosen for convenience and does not reflect a structural significance. 

The present invention provides the nucleotide sequence of SEQ ID 
NO:l, or a representative fragment thereof, in a form which can be readily 

25 used, analyzed, and interpreted by a skilled artisan. In one embodiment, the 

sequence is provided as a contiguous string of primary sequence information 
corresponding to the nucleotide sequence provided in SEQ ID NO:l. 
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As used herein, a "representative fragment of the nucleotide sequence 
depicted in SEQ ID NO:l " refers to any portion of SEQ ID NO: 1 which is not 
presently represented within a publicly available database. Preferred 
representative fragments of the present invention are Haemophilus influenzae 
5 open reading frames, expression modulating fragments, uptake modulating 

fragments, and fragments which can be used to diagnose the presence of 
Haemophilus influenzae Rd in sample. A non-limiting identification of such 
preferred representative fragments is provided in Tables 1(a) and and 2. 

The nucleotide sequence information provided in SEQ ID NO: 1 was 

10 obtained by sequencing the Haemophilus influenzae Rd genome using a 

megabase shotgun sequencing method. Using three parameters of accuracy 
discussed in the Examples below, the present inventors have calculated that the 
sequence in SEQ ID NO:l has a maximum accuracy of 99.98%. Thus, the 
nucleotide sequence provided in SEQ ID NO: 1 is a highly accurate, although 

15 not necessarily a 100% perfect, representation of the nucleotide sequence of 

the Haemophilus influenzae Rd genome. 

As discussed in detail below, using the information provided in SEQ 
ID NO:l and in Tables 1(a) and 2 together with routine cloning and 
sequencing methods, one of ordinary skill in the art will be able to clone and 

20 sequence all "representative fragments" of interest including open reading 

frames (ORFs) encoding a large variety of Haemophilus influenzae proteins. 
In very rare instances, this may reveal a nucleotide sequence error present in 
the nucleotide sequence disclosed in SEQ ID NO: 1 . Thus, once the present 
invention is made available (i.e., once the information in SEQ ID NO: 1 and 

25 Tables 1(a) and 2 have been made available), resolving a rare sequencing error 

in SEQ ID NO: 1 will be well within the skill of the art. Nucleotide sequence 
editing software is publicly available. For example, Applied Biosystem's (AB) 
AutoAssembler™ can be used as an aid during visual inspection of nucleotide 
sequences. 
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Even if all of the very rare sequencing errors in SEQ ID NO: 1 were 
corrected, the resulting nucleotide sequence would still be at least 99.9% 
identical to the nucleotide sequence in SEQ ID NO:l. 

The nucleotide sequences of the genomes from different strains of 
5 Haemophilus influenzae differ slightly. However, the nucleotide sequence of 

the genomes of all Haemophilus influenzae strains will be at least 99.9% 
identical to the nucleotide sequence provided in SEQ ID NO: 1. 

Thus, the present invention further provides nucleotide sequences 
which are at least 99.9% identical to the nucleotide sequence of SEQ ID NO: 1 
10 in a form which can be readily used, analyzed and interpreted by the skilled 

artisan. Methods for determining whether a nucleotide sequence is at least 
99.9% identical to the nucleotide sequence of SEQ ID NO: 1 are routine and 
readily available to the skilled artisan. For example, the well known fasta 
algothrithm (Pearson and Lipman, Proc. Natl. Acad. Sci. USA 35:2444 
15 (1988)) can be used to generate the percent identity of nucleotide sequences. 



Computer Related Embodiments 

The nucleotide sequence provided in SEQ ID NO:l, a representative 
fragment thereof, or a nucleotide sequence at least 99.9% identical to SEQ ID 
NO: 1 may be "provided" in a variety of mediums to facilitate use thereof. As 
used herein, provided refers to a manufacture, other than an isolated nucleic 
acid molecule, which contains a nucleotide sequence of the present invention, 
i.e., the nucleotide sequence provided in SEQ ID NO:l, a representative 
fragment thereof, or a nucleotide sequence at least 99.9% identical to SEQ ID 
NO: 1. Such a manufacture provides the Haemophilus influenzae Rd genome 
or a subset thereof (e.g., a Haemophilus Influenzae Rd open reading frame 
(ORF)) in a form which allows a skilled artisan to examine the manufacture 
using means not directly applicable to examining the Haemophilus influenzae 
Rd genome or a subset thereof as it exists in nature or in purified form. 
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In one application of this embodiment, a nucleotide sequence of the 
present invention can be recorded on computer readable media. As used 
herein, "computer readable media" refers to any medium which can be read 
and accessed directly by a computer. Such media include, but are not limited 
to: magnetic storage media, such as floppy discs, hard disc storage medium, 
and magnetic tape; optical storage media such as CD-ROM; electrical storage 
media such as RAM and ROM; and hybrids of these categories such as 
magnetic/optical storage media. A skilled artisan can readily appreciate how 
any of the presently known computer readable mediums can be used to create 
a manufacture comprising computer readable medium having recorded thereon 
a nucleotide sequence of the present invention. 

As used herein, M recorded " refers to a process for storing information 
on computer readable medium. A skilled artisan can readily adopt any of the 
presently know methods for recording information on computer readable 
medium to generate manufactures comprising the nucleotide sequence 
information of the present invention. 

A variety of data storage structures are available to a skilled artisan for 
creating a computer readable medium having recorded thereon a nucleotide 
sequence of the present invention. The choice of the data storage structure 
will generally be based on the means chosen to access the stored information. 
In addition, a variety of data processor programs and formats can be used to 
store the nucleotide sequence information of the present invention on computer 
readable medium. The sequence information can be represented in a word 
processing text file, formatted in commercially-available software such as 
WordPerfect and Microsoft Word, or represented in the form of an ASCII 
file, stored in a database application, such as DB2, Sybase, Oracle, or the like. 
A skilled artisan can readily adapt any number of dataprocessor structuring 
formats (e.g. text file or database) in order to obtain computer readable 
medium having recorded thereon the nucleotide sequence information of the 
present invention. 
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By providing the nucleotide sequence of SEQ ID NO: 1, a 
representative fragment thereof, or a nucleotide sequence at least 99.9% 
identical to SEQ ID NO:l in computer readable form, a skilled artisan can 
routinely access the sequence information for a variety of purposes. Computer 
5 software is publicly available which allows a skilled artisan to access sequence 

information provided in a computer readable medium. The examples which 
follow demonstrate how software which implements the BLAST (Altschul 
et al.,J. Mol. Biol. 275:403-410 (1990)) and BLAZE (Brutlag et aL, Comp. 
Chem. 77:203-207 (1993)) search algorithms on a Sybase system was used to 
10 identify open reading frames (ORFs) within the Haemophilus influenzae Rd 

genome which contain homology to ORFs or proteins from other organisms. 
Such ORFs are protein encoding fragments within the Haemophilus influenzae 
Rd genome and are useful in producing commercially important proteins such 
as enzymes used in fermentation reactions and in the production of 
15 commercially useful metabolites. 

The present invention further provides systems, particularly computer- 
based systems, which contain the sequence information described herein. Such 
systems are designed to identify commercially important fragments of the 
Haemophilus influenzae Rd genome. 
20 As used herein, "a computer-based system" refers to the hardware 

means, software means, and data storage means used to analyze the nucleotide 
sequence information of the present invention. The minimum hardware means 
of the computer-based systems of the present invention comprises a central 
processing unit (CPU), input means, output means, and data storage means. 
25 A skilled artisan can readily appreciate that any one of the currently available 

computer-based system are suitable for use in the present invention. 

As stated above, the computer-based systems of the present invention 
comprise a data storage means having stored therein a nucleotide sequence of 
the present invention and the necessary hardware means and software means 
30 for supporting and implementing a search means. As used herein, "data 

storage means" refers to memory which can store nucleotide sequence 
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information of the present invention, or a memory access means which can 
access manufactures having recorded thereon the nucleotide sequence 
information of the present invention. 

As used herein, "search means" refers to one or more programs which 
5 are implemented on the computer-based system to compare a target sequence 

or target structural motif with the sequence information stored within the data 
storage means. Search means are used to identify fragments or regions of the 
Haemophilus influenzae Rd genome which match a particular target sequence 
or target motif. A variety of known algorithms are disclosed publicly and a 

10 variety of commercially available software for conducting search means are 

and can be used in the computer-based systems of the present invention. 
Examples of such software includes, but is not limited to, MacPattern 
(EMBL), BLASTN and BLASTX (NCBIA). A skilled artisan can readily 
recognize that any one of the available algorithms or implementing software 

15 packages for conducting homology searches can be adapted for use in the 

present computer-based systems. 

As used herein, a "target sequence" can be any DNA or amino acid 
sequence of six or more nucleotides or two or more amino acids. A skilled 
artisan can readily recognize that the longer a target sequence is, the less likely 

20 a target sequence will be present as a random occurrence in the database. The 

most preferred sequence length of a target sequence is from about 10 to 100 
amino acids or from about 30 to 300 nucleotide residues. However, it is well 
recognized that searches for commercially important fragments of the 
Haemophilus influenzae Rd genome, such as sequence fragments involved in 

25 gene expression and protein processing, may be of shorter length. 

As used herein, "a target structural motif," or "target motif," refers to 
any rationally selected sequence or combination of sequences in which the 
sequence(s) are chosen based on a three-dimensional configuration which is 
formed upon the folding of the target motif. There are a variety of target 

30 motifs known in the art. Protein target motifs include, but are not limited to, 

enzymic active sites and signal sequences. Nucleic acid target motifs include, 
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but are not limited to, promoter sequences, hairpin structures and inducible 
expression elements (protein binding sequences). 

A variety of structural formats for the input and output means can be 
used to input and output the information in the computer-based systems of the 
5 present invention. A preferred format for an output means ranks fragments of 

the Haemophilus influenzae Rd genome possessing varying degrees of 
homology to the target sequence or target motif. Such presentation provides 
a skilled artisan with a ranking of sequences which contain various amounts of 
the target sequence or target motif and identifies the degree of homology 
10 contained in the identified fragment. 

A variety of comparing means can be used to compare a target 
sequence or target motif with the data storage means to identify sequence 
fragments of the Haemophilus influenzae Rd genome. In the present 
examples, implementing software which implement the BLAST and BLAZE 
15 algorithms (Altschul et al., J. Mol. Biol. 2/5:403-410 (1990)) was used to 

identify open reading frames within the Haemophilus influenzae Rd genome. 
A skilled artisan can readily recognize that any one of the publicly available 
homology search programs can be used as the search means for the computer- 
based systems of the present invention. 
20 One application of this embodiment is provided in Figure 2. Figure 2 

provides a block diagram of a computer system 102 that can be used to 
implement the present invention. The computer system 102 includes a 
processor 106 connected to a bus 104. Also connected to the bus 104 are a 
main memory 108 (preferably implemented as random access memory, RAM) 
25 and a variety of secondary storage devices 1 10, such as a hard drive 1 12 and 

a removable medium storage device 114. The removable medium storage 
device 1 14 may represent, for example, a floppy disk drive, a CD-ROM drive, 
a magnetic tape drive, etc. A removable storage medium 116 (such as a 
floppy disk, a compact disk, a magnetic tape, etc.) containing control logic 
30 and/or data recorded therein may be inserted into the removable medium 

storage device 1 14. The computer system 102 includes appropriate software 
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for reading the control logic and/or the data from the removable medium 
storage device 1 14 once inserted in the removable medium storage device 1 14. 

A nucleotide sequence of the present invention may be stored in a well 
known manner in the main memory 108, any of the secondary storage devices 
110, and/or a removable storage medium 116. Software for accessing and 
processing the genomic sequence (such as search tools, comparing tools, etc.) 
reside in main memory 108 during execution. 

Biochemical Embodiments 

Another embodiment of the present invention is directed to isolated 
fragments of the Haemophilus influenzae Rd genome. The fragments of the 
Haemophilus influenzae Rd genome of the present invention include, but are 
not limited to fragments which encode peptides, hereinafter open reading 
frames (ORFs), fragments which modulate the expression of an operably 
linked ORF, hereinafter expression modulating fragments (EMFs), fragments 
which mediate the uptake of a linked DNA fragment into a cell, hereinafter 
uptake modulating fragments (UMFs), and fragments which can be used to 
diagnose the presence of Haemophilus influenzae Rd in a sample, hereinafter 
diagnostic fragments (DFs). 

As used herein, an "isolated nucleic acid molecule" or an "isolated 
fragment of the Haemophilus influenzae Rd genome" refers to a nucleic acid 
molecule possessing a specific nucleotide sequence which has been subjected 
to purification means to reduce, from the composition, the number of 
compounds which are normally associated with the composition. A variety of 
purification means can be used to generated the isolated fragments of the 
present invention. These include, but are not limited to methods which 
separate constituents of a solution based on charge, solubility, or size. 

In one embodiment, Haemophilus influenaze Rd DNA can be 
mechanically sheared to produce fragments of 15-20 kb in length. These 
fragments can then be used to generate an Haemophilus influenzae Rd library 
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by inserting them into labda clones as described in the Examples below. 
Primers flanking, for examiple, an ORF provided in Table 1(a) can then be 
generated using nucleotide sequence information provided in SEQ ID NO: 1 . 
PCR cloning can then be used to isolate the ORF from the lambda DNA 
5 library. PCR cloning is well known in the art. Thus, given the availability 

of SEQ ID NO:l, Table 1(a) and Table 2, it would be routine to isolate any 
ORF or other nucleic acid fragment of the present invention. 

The isolated nucleic acid molecules of the present invention include, 
but are not limited to single stranded and double stranded DNA, and single 

10 stranded RNA. 

As used herein, an "open reading frame," ORF, means a series of 
triplets coding for amino acids without any termination codons and is a 
sequence translatable into protein. Tables la, lb and 2 identify ORFs in the 
Haemophilus influenzae Rd genome. In particular, Table la indicates the 

15 location of ORFs within the Haemophilus influenzae genome which encode the 

recited protein based on homology matching with protein sequences from the 
organism appearing in parentheticals (see the fourth column of Table 1(a)). 

The first column of Table 1(a) provides the "GenelD" of a particular 
ORF. This information is useful for two reasons. First, the complete map of 

20 the Haemophilus influenzae Rd genome provided in Figures 6(A)-6(D) refers 

to the ORFs according to their GenelD numbers. Second, Table 1(b) uses the 
GenelD numbers to indicate which ORFs were provided previously in a public 
database. 

The second and third columns in Table 1(a) indicate an ORFs position 
25 in the nucleotide sequence provided in SEQ ID NO: 1 . One of ordinary skill 

will recognize that ORFs may be oriented in opposite directions in the 
Haemophilus influenae genome. This is reflected in columns 2 and 3. 

The fifth column of Table 1(a) indicates the percent identity of the 
protein encoded for by an ORF to the corresponding protein from the orgaism 
30 appearing in parentheticals in the fourth column. 
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The sixth column of Table 1(a) indicates the percent similarity of the 
protein encoded for by an ORF to the corresponding protein from the organism 
appearing in parentheticals in the fourth column. The concepts of percent 
identity and percent similarity of two polypeptide sequences is well understood 
5 in the art. For example, two polypeptides 10 amino acids in length which 

differ at three amino acid positions (e.g., at positions 1, 3 and 5) are said to 
have a percent identity of 70%. However, the same two polypeptides would 
be deemed to have a percent similarity of 80% if, for example at position 5, 
the amino acids moieties, although not identical, were "similar" (i.e., 
10 possessed similar biochemical characteristics). 

The seventh column in Table 1(a) indicates the lenth of the amino acid 
homology match. 

Table 2 provides ORFs of the Haemophilus influenzae Rd genome 
which encode polypeptide sequences which did not elicit a "homology match" 
15 with a known protein sequence from another organism. Further details 

concerning the algorithms and criteria used for homology searches are 
provided in the Examples below. 

A skilled artisan can readily identify ORFs in the Haemophilus 
influenzae Rd genome other than those listed in Tables 1(a), 1(b) and 2, such 
20 as ORFs which are overlapping or encoded by the opposite strand of an 

identified ORF in addition to those ascertainable using the computer-based 
systems of the present invention. 

As used herein, an "expression modulating fragment/ EMF, means a 
series of nucleotide molecules which modulates the expression of an operably 
25 linked ORF or EMF. 

As used herein, a sequence is said to "modulate the expression of an 
operably linked sequence" when the expression of the sequence is altered by 
the presence of the EMF. EMFs include, but are not limited to, promoters, 
and promoter modulating sequences (inducible elements). One class of EMFs 
30 are fragments which induce the expression or an operably linked ORF in 

response to a specific regulatory factor or physiological event. A review of 
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known EMFs from Haemophilus are described by (Tomb et al. Gene 104:1-10 
(1991), Chandler, M. S., Proc. Natl. Acad. Sci. USA 89:1626-1630 (1992). 

EMF sequences can be identified within the Haemophilus influenzae Rd 
genome by their proximity to the ORFs provided in Tables 1(a), 1(b) and 2. 
5 An intergenic segment, or a fragment of the intergenic segment, from about 

10 to 200 nucleotides in length, taken 5' from any one of the ORFs of 
Tables 1(a), 1(b), or 2 will modulate the expression of an operably linked 3' 
ORF in a fashion similar to that found with the naturally linked ORF 
sequence. As used herein, an "intergenic segment- refers to the fragments of 
10 the Haemophilus genome which are between two ORF(s) herein described. 

Alternatively, EMFs can be identified using known EMFs as a target sequence 
or target motif in the computer-based systems of the present invention. 

The presence and activity of an EMF can be confirmed using an EMF 
trap vector. An EMF trap vector contains a cloning site 5' to a marker 
15 sequence. A marker sequence encodes an identifiable phenotype, such as 

antibiotic resistance or a complementing nutrition auxotrophic factor, which 
can be identified or assayed when the EMF trap vector is placed within an 
appropriate host under appropriate conditions. As described above, a EMF 
will modulate the expression of an operably linked marker sequence. A more 
20 detailed discussion of various marker sequences is provided below. 

A sequence which is suspected as being a EMF is cloned in all three 
reading frames in one or more restriction sites upstream from the marker 
sequence in the EMF trap vector. The vector is then transformed into an 
appropriate host using known procedures and the phenotype of the transformed 
25 host in examined under appropriate conditions. As described above, an EMF 

will modulate the expression of an operably linked marker sequence. 

As used herein, an "uptake modulating fragment," UMF, means a 
series of nucleotide molecules which mediate the uptake of a linked DNA 
fragment into a cell. UMFs can be readily identified using known UMFs as 
30 a target sequence or target motif with the computer-based systems described 

above. 
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The presence and activity of a UMF can be confirmed by attaching the 
suspected UMF to a marker sequence. The resulting nucleic acid molecule is 
then incubated with an appropriate host under appropriate conditions and the 
uptake of the marker sequence is determined. As described above, a UMF 
5 will increase the frequency of uptake of a linked marker sequence. A review 

of DNA uptake in Haemophilus is provided by Goodgall, S.H. , et al. , J. Baa. 
772:5924-5928 (1990). 

As used herein, a "diagnostic fragment," DF, means a series of 
nucleotide molecules which selectively hybridize to Haemophilus influenzae 
10 sequences. DFs can be readily identified by identifying unique sequences 

within the Haemophilus influenzae Rd genome, or by generating and testing 
probes or amplification primers consisting of the DF sequence in an 
appropriate diagnostic format which determines amplification or hybridization 
selectivity. 

15 The sequences falling within the scope of the present invention are not 

limited to the specific sequences herein described, but also include allelic and 
species variations thereof. Allelic and species variations can be routinely 
determined by comparing the sequence provided in SEQ ID NO:l, a 
representative fragment thereof, or a nucleotide sequence at least 99.9% 

20 identical to SEQ ID NO: 1 with a sequence from another isolate of the same 

species. Furthermore, to accommodate codon variability, the invention 
includes nucleic acid molecules coding for the same amino acid sequences as 
do the specific ORFs disclosed herein. In other words, in the coding region 
of an ORF, substitution of one codon for another which encodes the same 

25 amino acid is expressly contemplated. 

Any specific sequence disclosed herein can be readily screened for errors 
by resequencing a particular fragment, such as an ORF, in both directions 
(i.e., sequence both strands). Alternatively, error screening can be performed 
by sequencing corresponding polynucleotides of Haemophilus influenzae origin 

30 isolated by using part or all of the fragments in question as a probe or primer. 
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Each of the ORFs of the Haemophilus influenzae Rd genome disclosed in 
Tables 1(a), 1(b) and 2, and the EMF found 5' to the ORF, can be used in 
numerous ways as polynucleotide reagents. The sequences can be used as 
diagnostic probes or diagnostic amplification primers to detect the presence of 
5 a specific microbe, such as Haemophilus influenzae RD, in a sample. This is 

especially the case with the fragments or ORFs of Table 2, which will be 
highly selective for Haemophilus influenzae. 

In addition, the fragments of the present invention, as broadly described, 
can be used to control gene expression through triple helix formation or 
10 antisense DNA or RNA, both of which methods are based on the binding of 

a polynucleotide sequence to DNA or RNA. Polynucleotides suitable for use 
in these methods are usually 20 to 40 bases in length and are designed to be 
complementary to a region of the gene involved in transcription (triple 
helix - see Lee et al. , Nucl Acids Res. 5:3073 (1979); Cooney et al. , Science 
15 241 :456 (1988); and Dervan et al. , Science 251 : 1360 ( 199 1)) or to the mRNA 

itself (antisense - Okano, J. Neurvchem. 56:560 (1991); Oligodeoxynucleotides 
as Antisense Inhibitors of Gene Expression, CRC Press, Boca Raton, FL 
(1988)). 
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Triple helix- formation optimally results in a shut-off of RNA transcription 
from DNA, while antisense RNA hybridization blocks translation of an mRNA 
molecule into polypeptide. Both techniques have been demonstrated to be 
effective in model systems. Information contained in the sequences of the 
present invention is necessary for the design of an antisense or triple helix 
oligonucleotide. 

The present invention further provides recombinant constructs 
comprising one or more fragments of the Haemophilus influenzae Rd genome 
of the present invention. The recombinant constructs of the present invention 
comprise a vector, such as a plasmid or viral vector, into which a fragment of 
the Haemophilus influenzae Rd has been inserted, in a forward or reverse 
orientation. In the case of a vector comprising one of the ORFs of the present 
invention, the vector may further comprise regulatory sequences, including for 
example, a promoter, operably linked to the ORF. For vectors comprising the 
EMFs and UMFs of the present invention, the vector may further comprise a 
marker sequence or heterologous ORF operably linked to the EMF or UMF. 
Large numbers of suitable vectors and promoters are known to those of skill 
in the art and are commercially available for generating the recombinant 
constructs of the present invention. The following vectors are provided by 
way of example. Bacterial: pBs, phagescript, PsiX174, pBluescript SK, pBs 
KS, pNH8a, pNH16a, pNH18a, pNH46a (Stratagene); pTrc99A, pKK223-3, 
pKK233-3, pDR540, pRJT5 (Pharmacia). Eukaryotic: pWLneo, pSV2cat, 
pOG44, pXTl, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL (Pharmacia). 

Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable markers. 
Two appropriate vectors are pKK232-8 and pCM7. Particular named bacterial 
promoters include lad, lacZ, T3, T7, gpt, lambda P R , and trc. Eukaryotic 
promoters include CMV immediate early, HSV thymidine kinase, early and 
late SV40, LTRs from retrovirus, and mouse metallothionein-I. Selection of 
the appropriate vector and promoter is well within the level of ordinary skill 
in the art. 
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The present invention further provides host cells containing any one of 
the isolated fragments of the Haemophilus influenzae Rd genome of the present 
invention, wherein the fragment has been introduced into the host cell using 
known transformulation methods. The host cell can be a higher eukaryotic 
5 host cell, such as a mammalian cell, a lower eukaryotic host cell, such as a 

yeast cell, or the host cell can be a procaryotic cell, such as a bacterial cell. 
Introduction of the recombinant construct into the host cell can be effected by 
calcium phosphate transfection, DEAE, dextran mediated transfection, or 
electroporation (Davis, L. et al. , Basic Methods in Molecular Biology (1986)). 
IQ The host cells containing one of the fragments of the Haemophilus 

influenzae Rd genome of the present invention, can be used in conventional 
manners to produce the gene product encoded by the isolated fragment (in the 
case of an ORF) or can be used to produce a heterologous protein under the 
control of the EMF. 

15 The present invention further provides isolated polypeptides encoded 

by the nucleic acid fragments of the present invention or by degenerate 
variants of the nucleic acid fragments of the present invention. By "degenerate 
variant" is intended nucleotide fragments which differ from a nucleic acid 
fragment of the present invention (e.g. , an ORF) by nucleotide sequence but, 
20 due to the degeneracy of the Genetic Code, encode an identical polypeptide 

sequence. Preferred nucleic acid fragments of the present invention are the 
ORFs depicted in Table 1(a) which encode proteins. 

A variety of methodologies known in the art can be utilized to obtain 
any one of the isolated polypeptides or proteins of the present invention. At 
25 the simplest level, the amino acid sequence can be synthesized using 

commercially available peptide synthesizers. This is particularly useful in 
producing small peptides and fragments of larger polypeptides. Fragments are 
useful, for example, in generating antibodies against the native polypeptide. 
In an alternative method, the polypeptide or protein is purified from bacterial 
30 cells which naturally produce the polypeptide or protein. One skilled in the art 

can readily follow known methods for isolating polpeptides and proteins in 
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order to obtain one of the isolated polypeptides or proteins of the present 
invention. These include, but are not limited to, immunochromatography, 
HPLC, size-exclusion chromatography, ion-exchange chromatography, and 
immuno-affinity chromatography. 
5 The polypeptides and proteins of the present invention can alternatively 

be purified from cells which have been altered to express the desired 
polypeptide or protein. As used herein, a cell is said to be altered to express 
a desired polypeptide or protein when the cell, through genetic manipulation, 
is made to produce a polypeptide or protein which it normally does not 
10 produce or which the cell normally produces at a lower level. One skilled in 

the art can readily adapt procedures for introducing and expressing either 
recombinant or synthetic sequences into eukaryotic or prokaryotic cells in 
order to generate a cell which produces one of the polypeptides or proteins of 
the present invention. 

15 Any host/ vector system can be used to express one or more of the 

ORFs of the present invention. These include, but are not limited to, 
eukaryotic hosts such as HeLa cells, Cv-1 cell, COS cells, and Sf9 cells, as 
well as prokaryotic host such as E. coli and B. subtilis. The most preferred 
cells are those which do not normally express the particular polypeptide or 

20 protein or which expresses the polypeptide or protein at low natural level. 

"Recombinant," as used herein, means that a polypeptide or protein is 
derived from recombinant (e.g., microbial or mammalian) expression systems. 
"Microbial" refers to recombinant polypeptides or proteins made in bacterial 
or fungal (e.g., yeast) expression systems. As a product, "recombinant 

25 microbial" defines a polypeptide or protein essentially free of native 

endogenous substances and unaccompanied by associated native glycosylation. 
Polypeptides or proteins expressed in most bacterial cultures, e.g., £. coli, 
will be free of glycosylation modifications; polypeptides or proteins expressed 
in yeast will have a glycosylation pattern different from that expressed in 

30 mammalian cells. 
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"Nucleotide sequence" refers to a heteropolymer of 
deoxyribonucleotides. Generally, DNA segments encoding the polypeptides 
and proteins provided by this invention are assembled from fragments of the 
Haemophilus influenzae Rd genome and short oligonucleotide linkers, or from 
5 a series of oligonucleotides, to provide a synthetic gene which is capable of 

being expressed in a recombinant transcriptional unit comprising regulatory 
elements derived from a microbial or viral operon. 

"Recombinant expression vehicle or vector" refers to a plasmid or phage 
or virus or vector, for expressing a polypeptide from a DNA (RNA) sequence. 
10 The expression vehicle can comprise a transcriptional unit comprising an 

assembly of (1) a genetic element or elements having a regulatory role in gene 
expression, for example, promoters or enhancers, (2) a structural or coding 
sequence which is transcribed into mRNA and translated into protein, and 
(3) appropriate transcription initiation and termination sequences. Structural 
15 units intended for use in yeast or eukaryotic expression systems preferably 

include a leader sequence enabling extracellular secretion of translated protein 
by a host cell. Alternatively, where recombinant protein is expressed without 
a leader or transport sequence, it may include an N-terminal methionine 
residue. This residue may or may not be subsequently cleaved from the 
20 expressed recombinant protein to provide a final product. 

"Recombinant expression system" means host cells which have stably 
integrated a recombinant transcriptional unit into chromosomal DNA or carry 
the recombinant transcriptional unit extra chromosomally. The cells can be 
prokaryotic or eukaryotic. Recombinant expression systems as defined herein 
25 will express heterologous polypeptides or proteins upon induction of the 

regulatory elements linked to the DNA segment or synthetic gene to be 
expressed. 

Mature proteins can be expressed in mammalian cells, yeast, bacteria, 
or other cells under the control of appropriate promoters. Cell-free translation 
30 systems can also be employed to produce such proteins using RNAs derived 

from the DNA constructs of the present invention. Appropriate cloning and 
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expression vectors for use with prokaryatic and eukaryotic hosts are described 
by Sambrook, et ah , in Molecular Cloning: A Laboratory Manual, Second 
Edition, Cold Spring Harbor, New York (1989), the disclosure of which is 
hereby incorporated by reference. 
5 Generally, recombinant expression vectors will include origins of 

replication and selectable markers permitting transformation of the host cell, 
e.g., the ampicillin resistance gene of E. coli and 5. cerevisiae TRP1 gene, 
and a promoter derived from a highly-expressed gene to direct transcription of 
a downstream structural sequence. Such promoters can be derived from 

10 operons encoding glycolytic enzymes such as 3-phosphoglycerate kinase 

(PGK), a- factor, acid phosphatase, or heat shock proteins, among others. The 
heterologous structural sequence is assembled in appropriate phase with 
translation initiation and termination sequences, and preferably, a leader 
sequence capable of directing secretion of translated protein into the 

15 periplasmic space or extracellular medium. Optionally, the heterologous 

sequence can encode a fusion protein including an N-terminal identification 
peptide imparting desired characteristics, e.g., stabilization or simplified 
purification of expressed recombinant product. 

Useful expression vectors for bacterial use are constructed by inserting a 

20 structural DNA sequence encoding a desired protein together with suitable 

translation initiation and termination signals in operable reading phase with a 
functional promoter. The vector will comprise one or more phenotypic 
selectable markers and an origin of replication to ensure maintenance of the 
vector and to, if desirable, provide amplification within the host. Suitable 

25 prokaryotic hosts for transformation include E. coli, Bacillus subtilis. 

Salmonella typhimurium and various species within the genera Pseudomonas, 
Streptomyces, and Staphylococcus, although others may, also be employed as 
a matter of choice. 

As a representative but nonlimiting example, useful expression vectors for 

30 bacterial use can comprise a selectable marker and bacterial origin of 

replication derived from commercially available plasmids comprising genetic 
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elements of the well known cloning vector pBR322 (ATCC 37017). Such 
commercial vectors include, for example, pKK223-3 (Pharmacia Fine 
Chemicals, Uppsala, Sweden) and GEM 1 (Promega Biotec, Madison, WI, 
USA). These pBR322 "backbone" sections are combined with an appropriate 
5 promoter and the structural sequence to be expressed. 

Following transformation of a suitable host strain and growth of the host 
strain to an appropriate cell density, the selected promoter is derepressed by 
appropriate means (e.g., temperature shift or chemical induction) and cells are 
cultured for an additional period. Cells are typically harvested by 
10 centrifugation, disrupted by physical or chemical means, and the resulting 

crude extract retained for further purification. 

Various mammalian cell culture systems can also be employed to 
express recombinant protein. Examples of mammalian expression systems 
include the COS-7 lines of monkey kidney fibroblasts, described by Gluzman, 
15 Cell 23: 175 (1981), and other cell lines capable of expressing a compatible 

vector, for example, the C127, 3T3, CHO, Hel^ and BHK cell lines. 
Mammalian expression vectors will comprise an origin of replication, a 
suitable promoter and enhancer, and also any necessary ribosome binding sites, 
polyadenylation site, splice donor and acceptor sites, transcriptional 
20 termination sequences, and 5' flanking nontranscribed sequences. DNA 

sequences derived from the SV40 viral genome, for example, SV40 origin, 
early promoter, enhancer, splice, and polyadenylation sites may be used to 
provide the required nontranscribed genetic elements. 

Recombinant polypeptides and proteins produced in bacterial culture 
25 is usually isolated by initial extraction from cell pellets, followed by one or 

more salting-out, aqueous ion exchange or size exclusion chromatography 
steps. Protein refolding steps can be used, as necessary, in completing 
configuration of the mature protein. Finally, high performance liquid 
chromatography (HPLC) can be employed for final purification steps. 
30 Microbial cells employed in expression of proteins can be disrupted by any 
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convenient method, including freeze- thaw cycling, sonication, mechanical 
disruption, or use of cell lysing agents. 

The present invention further includes isolated polypeptides, proteins 
and nucleic acid molecules which are substantially equivalent to those herein 
5 described. As used herein, substantially equivalent can refer both to nucleic 

acid and amino acid sequences, for example a mutant sequence, that varies 
from a reference sequence by one or more substitutions, deletions, or 
additions, the net effect of which does not result in an adverse functional 
dissimilarity between reference and subject sequences. For purposes of the 

10 present invention, sequences having equivalent biological activity, and 

equivalent expression characteristics are considered substantially equivalent. 
For purposes of determining equivalence, truncation of the mature sequence 
should be disregarded. 

The invention further provides methods of obtaining homologs from 

15 other strains of Haemophilus influenzae, of the fragments of the Haemophilus 

influenzae Rd genome of the present invention and homologs of the proteins 
encoded by the ORFs of the present invention. As used herein, a sequence or 
protein of Haemophilus influenzae is defined as a homolog of a fragment of the 
Haemophilus influenzae Rd genome or a protein encoded by one of the ORFs 

20 of the present invention, if it shares significant homology to one of the 

fragments of the Haemophilus influenzae Rd genome of the present invention 
or a protein encoded by one of the ORFs of the present invention. 
Specifically, by using the sequence disclosed herein as a probe or as primers, 
and techniques such as PCR cloning and colony/plaque hybridization, one 

25 skilled in the art can obtain homologs. 

As used herein, two nucleic acid molecules or proteins are said to 
"share significant homology" if the two contain regions which process greater 
than 85% sequence (amino acid or nucleic acid) homology. 

Region specific primers or probes derived from the nucleotide sequence 

30 provided in SEQ ID NO:l or from a nucleotide sequence at least 99.9% 

identical to SEQ ID NO: 1 can be used to prime DNA synthesis and PCR 
amplification, as well as to identify colonies containing cloned DNA encoding 
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a homolog using known methods (Innis et ai, PCR Protocols, Academic 
Press, San Diego, CA (1990)). 

When using primers derived from SEQ ID NO: 1 or from a nucleotide 
sequence at least 99.9% identical to SEQ ID NO: 1 , one skilled in the art wiU 
5 recognize that by employing high stringency conditions (e.g. , annealing at 50- 

60°C) only sequences which are greater than 75 % homologous to the primer 
will be amplified. By employing lower stringency conditions (e.g. , annealing 
at 35-37°C), sequences which are greater than 40-50% homologous to the 
primer will also be amplified. 
10 When using DNA probes derived from SEQ ID NO:l or from a 

nucleotide sequence at least 99.9% identical to SEQ ID NO:l for 
colony/plaque hybridization, one skilled in the art will recognize that by 
employing high stringency conditions (e.g., hybridizing at 50-65°C in 5X 
SSPC and 50% formamide, and washing at 50-65°C in 0.5X SSPC), 
15 sequences having regions which are greater than 90% homologous to the probe 

can be obtained, and that by employing lower stringency conditions (e.g., 
hybridizing at 35-37°C in 5X SSPC and 40-45% formamide, and washing at 
42°C in SSPC), sequences having regions which are greater than 35-45% 
homologous to the probe will be obtained. 
20 Any organism can be used as the source for homologs of the present 

invention so long as the organism naturally expresses such a protein or 
contains genes encoding the same. The most preferred organism for isolating 
homologs are bacterias which are closely related to Haemophilus influenzae 
Rd. 



25 Uses for the Compositions of the Invention 



30 



Each ORF provided in Table 1(a) was assigned to one of 102 biological 
role categories adapted from Riley, M., Microbiology Reviews 57(4):S62 
(1993)). This allows the skilled artisan to determine a use for each identified 
coding sequence. Tables 1(a) further provides an identification of the type of 
polypeptide which is encoded for by each ORF. As a result, one skilled in the 
art can use the polypeptides of the present invention for commercial, 
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therapeutic and industrial purposes consistent with the type of putative 
identification of the polypeptide. 

Such identifications permit one skilled in the art to use the Haemophilus 
influenzae ORFs in a manner similar to the known type of sequences for which 
5 the identification is made; for example, to ferment a particular sugar source 

or to produce a particular metabolite. (For a review of enzymes used within 
the commercial industry, see Biochemical Engineering and Biotechnology 
Handbook 2nd, eds. Macmillan Publ. Ltd., NY (1991) and Biocatalysts in 
Organic Syntheses, ed. J. Tramper et al., Elsevier Science Publishers, 
10 Amsterdam, The Netherlands (1985)). 

1. Biosynthetic Enzymes 

Open reading frames encoding proteins involved in mediating the 
catalytic reactions involved in intermediary and macromolecular metabolism, 
the biosynthesis of small molecules, cellular processes and other functions 
includes enzymes involved in the degradation of the intermediary products of 
metabolism, enzymes involved in central intermediary metabolism, enzymes 
involved in respiration, both aerobic and anaerobic, enzymes involved in 
fermentation, enzymes involved in ATP proton motor force conversion, 
enzymes involved in broad regulatory function, enzymes involved in amino 
acid synthesis, enzymes involved in nucleotide synthesis, enzymes involved in 
cofactor and vitamin synthesis, can be used for industrial biosynthesis. The 
various metabolic pathways present in Haemophilus can be identified based on 
absolute nutritional requirements as well as by examining the various enzymes 
identified in Table 1(a). 

Identified within the category of intermediary metabolism, a number 
of the proteins encoded by the identified ORFs in Tables 1(a) are particularly 
involved in the degradation of intermediary metabolites as well as non- 
macromolecular metabolism. Some of the enzymes identified include 
amylases, glucose oxidases, and catalase. 
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Proteolytic enzymes are another class of commercially important 
enzymes. Proteolytic enzymes find use in a number of industrial processes 
including the processing of flax and other vegetable fibers, in the extraction, 
clarification and depectinization of fruit juices, in the extraction of vegetables' 

5 oil and in the maceration of fruits and vegetables to give unicellular fruits. A 

detailed review of the proteolytic enzymes used in the food industry is 
provided by Rombouts et aL, Symbiosis 27:79 (1986) and Voragen et aL in 
Biocataiyst in Agricultural Biotechnology, edited J.R. Whi taker et aL, 
American Chemical Society Symposium Series 389:93 (1989)). 

10 The metabolism of glucose, galactose, fructose and xylose are 

important parts of the primary metabolism of Haemophilus. Enzymes involved 
in the degradation of these sugars can be used in industrial fermentation. 
Some of the important sugar transforming enzymes, from a commercial 
viewpoint, include sugar isomerases such as glucose isomerase. Other 

15 metabolic enzymes have found commercial use such as glucose oxidases which 

produces ketogulonic acid (KGA). KGA is an intermediate in the commercial 
production of ascorbic acid using the Reichstein's procedure (see Krueger 
et a/., Biotechnology 6(A) , Rhine, H.J. et aL, eds., Verlag Press, Weinheim, 
Germany (1984)). 

20 Glucose oxidase (GOD) is commercially available and has been used 

in purified form as well as in an immobilized form for the deoxygenation of 
beer. See Hartmeir et al., Biotechnology Letters 7:21 (1979). The most 
important application of GOD is the industrial scale fermentation of gluconic 
acid. Market for gluconic acids which are used in the detergent, textile, 

25 leather, photographic, pharmaceutical, food, feed and concrete industry (see 

Bigelis in Gene Manipulations and Fungi, Benett, J. W. et aL , eds. , Academic 
Press, New York (1985), p. 357). In addition to industrial applications, GOD 
has found applications in medicine for quantitative determination of glucose 
in body fluids recently in biotechnology for analyzing syrups from starch and 

30 cellulose hydrosylates. See Owusu et aL, Biochem. et Biophysica. Acta. 

872:83 (1986). 
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The main sweetener used in the world today is sugar which comes from 
sugar beets and sugar cane. In the field of industrial enzymes, the glucose 
isomerase process shows the largest expansion in the market today. Initially, 
soluble enzymes were used and later immobilized enzymes were developed 
5 (Krueger et al , Biotechnology, The Textbook of Industrial Microbiology, 

Sinauer Associated Incorporated, Sunderland, Massachusetts (1990)). Today, 
the use of glucose-produced high fructose syrups is by far the largest industrial 
business using immobilized enzymes. A review of the industrial use of these 
enzymes is provided by Jorgensen, Starch 40:307 (1988). 

10 Proteinases, such as alkaline serine proteinases, are used as detergent 

additives and thus represent one of the largest volumes of microbial enzymes 
used in the industrial sector. Because of their industrial importance, there is 
a large body of published and unpublished information regarding the use of 
these enzymes in industrial processes. (See Faultman et aL , Acid Proteases 

15 Structure Function and Biology, Tang, J., ed., Plenum Press, New York 

(1977) and Godfrey et al. , Industrial Enzymes, MacMillan Publishers, Surrey, 
UK (1983) and Hepner et al. , Report Industrial Enzymes by 1990, Hel Hepner 
& Associates, London (1986)), 

Another class of commercially usable proteins of the present invention 

20 are the microbial lipases identified in Table 1 (see Macrae et aL , Philosophical 

Transactions of the Chiral Society of London 310:227 (1985) and Poserke, 
Journal of the American Oil Chemist Society 61: 1758 (1984). A major use of 
lipases is in the fat and oil industry for the production of neutral glycerides 
using lipase catalyzed inter-esterification of readily available triglycerides. 

25 Application of lipases include the use as a detergent additive to facilitate the 

removal of fats from fabrics in the course of the washing procedures. 

The use of enzymes, and in particular microbial enzymes, as catalyst 
for key steps in the synthesis of complex organic molecules is gaining 
popularity at a great rate. One area of great interest is the preparation of chiral 

30 intermediates. Preparation of chiral intermediates is of interest to a wide range 

of synthetic chemists particularly those scientists involved with the preparation 
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of new pharmaceuticals, agrochernicals, fragrances and flavors. (See Davies 
et aL, Recent Advances in the Generation of Chiral Intermediates Using 
Enzymes, CRC Press, Boca Raton, Florida (1990)). The following reactions 
catalyzed by enzymes are of interest to organic chemists: hydrolysis of 
5 carboxylic acid esters, phosphate esters, amides and nitriles, esterification 

reactions, trans-esterification reactions, synthesis of amides, reduction of 
alkanones and oxoalkanates, oxidation of alcohols to carbonyl compounds, 
oxidation of sulfides to sulfoxides, and carbon bond forming reactions such as 
the aldol reaction. When considering the use of an enzyme encoded by one 
10 of the ORFs of the present invention for biotransformation and organic 

synthesis it is sometimes necessary to consider the respective advantages and 
disadvantages of using a microorganism as opposed to an isolated enzyme. 
Pros and cons of using a whole cell system on the one hand or an isolated 
partially purified enzyme on the other hand, has been described in detail by 
15 Bud et al. , Chemistry in Britain (1987), p. 127. 

Amino transferases, enzymes involved in the biosynthesis and 
metabolism of amino acids, are useful in the catalytic production of amino 
acids. The advantages of using microbial based enzyme systems is that the 
amino transferase enzymes catalyze the stereo-selective synthesis of only 
20 /-amino acids and generally possess uniformly high catalytic rates. A 

description of the use of amino transferases for amino acid production is 
provided by RoseUe-David, Methods ofEnzymology 136A19 (1987). 

Another category of useful proteins encoded by the ORFs of the present 
invention include enzymes involved in nucleic acid synthesis, repair, and 
25 recombination. A variety of commercially important enzymes have previously 

been isolated from members of Haemophilus sp. These include the Hinc II, 
Hind III, and Hinf I restriction endonucleases. Table 1(a) identifies a wide 
array of enzymes, such as restriction enzymes, ligases, gyrases and 
methylases, which have immediate use in the biotechnology industry. 



30 



2. Generation of Antibodies 
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As described here, the proteins of the present invention, as well as 
homologs thereof, can be used in a variety procedures and methods known in 
the art which are currently applied to other proteins. The proteins of the 
present invention can further be used to generate an antibody which selectively 
5 binds the protein. Such antibodies can be either monoclonal or polyclonal 

antibodies, as well fragments of these antibodies, and humanized forms. 

The invention further provides antibodies which selectively bind to one 
of the proteins of the present invention and hybridomas which produce these 
antibodies. A hybridoma is an immortalized cell line which is capable of 

10 secreting a specific monoclonal antibody. 

In general, techniques for preparing polyclonal and monoclonal 
antibodies as well as hybridomas capable of producing the desired antibody are 
well known in the art (Campbell, A.M., Monoclonal Antibody Technology: 
Laboratory Techniques in Biochemistry and Molecular Biology , Elsevier 

15 Science Publishers, Amsterdam, The Netherlands (1984); St. Groth et al , 7. 

Immunol Methods 55:1-21 (1980); Kohler and Milstein, Nature 256:495-497 
(1975)), the trioma technique, the human B-ceil hybridoma technique (Kozbor 
et al. , Immunology Today 4:72 (1983); Cole et al. , in Monoclonal Antibodies 
and Cancer Therapy, Alan R. Liss, Inc. (1985), pp. 77-96). 

20 Any animal (mouse, rabbit, etc*) which is known to produce antibodies 

can be immunized with the pseudogene polypeptide. Methods for 
immunization are well known in the art. Such methods include subcutaneous 
or interperitoneal injection of the polypeptide* One skilled in the art will 
recognize that the amount of the protein encoded by the ORF of the present 

25 invention used for immunization will vary based on the animal which is 

immunized, the antigenicity of the peptide and the site of injection. 

The protein which is used as an immunogen may be modified or 
administered in an adjuvant in order to increase the protein's antigenicity. 
Methods of increasing the antigenicity of a protein are well known in the art 

30 and include, but are not limited to coupling the antigen with a heterologous 
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protein (such as globulin or 0-galactosidase) or through the inclusion of an 
adjuvant during immunization. 

For monoclonal antibodies, spleen cells from the immunized animals 
are removed, fused with myeloma cells, such as SP2/0-Agl4 myeloma cells, 
and allowed to become monoclonal antibody producing hybridoma cells. 

Any one of a number of methods well known in the art can be used to 
identify the hybridoma cell which produces an antibody with the desired 
characteristics. These include screening the hybridomas with an ELISA assay, 
western blot analysis, or radioimmunoassay (Lutz et aL, Exp. Cell Res. 
775:109-124 (1988)). 

Hybridomas secreting the desired antibodies are cloned and the class 
and subclass is determined using procedures known in the art (Campbell, 
A.M., Monoclonal Antibody Technology: Laboratory Techniques in 
Biochemistry and Molecular Biology, Elsevier Science Publishers, Amsterdam, 
The Netherlands (1984)). 

Techniques described for the production of single chain antibodies 
(U.S. Patent 4,946,778) can be adapted to produce single chain antibodies to 
proteins of the present invention. 

For polyclonal antibodies, antibody containing an ti sera is isolated from 
the immunized animal and is screened for the presence of antibodies with the 
desired specificity using one of the above-described procedures. 

The present invention further provides the above-described antibodies 
in detectably labelled form. Antibodies can be detectably labelled through the 
use of radioisotopes, affinity labels (such as biotin, avidin, etc.), enzymatic 
labels (such as horseradish peroxidase, alkaline phosphatase, etc.) fluorescent 
labels (such as FTTC or rhodamine, etc.), paramagnetic atoms, etc. 
Procedures for accomplishing such labelling are well-known in the art, for 
example see (Sternberger, L.A. et al., J. Histochem. Cytochem. 78:315 
(1970); Bayer, E.A. etal. 9 Meth. Enzym. 52:308 (1979); Engval, E. etal., 
Immunol 109:129 (1972); Goding, J.W. 7. Immunol. Meth. 75:215 (1976)). 
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The labeled antibodies of the present invention can be used for in vitro , 
in vivo, and in situ assays to identify cells or tissues in which a fragment of the 
Haemophilus influenzae Rd genome is expressed. 

The present invention further provides the above-described antibodies 
5 immobilized on a solid support. Examples of such solid supports include 

plastics such as polycarbonate, complex carbohydrates such as agarose and 
sepharose, acrylic resins and such as polyacrylamide and latex beads. 
Techniques for coupling antibodies to such solid supports are well known in 
the art (Weir, D.M. et al. , "Handbook of Experimental Immunology" 4th Ed., 
10 Blackwell Scientific Publications, Oxford, England, Chapter 10 (1986); 

Jacoby, W.D. et aL, Meth. Enzym. 34 Academic Press, N.Y. (1974)). The 
immobilized antibodies of the present invention can be used for in vitro, in 
vivo, and in situ assays as well as for immunoaffinity purification of the 
proteins of the present invention. 

15 3. Diagnostic Assays and Kits 

The present invention further provides methods to identify the 
expression of one of the ORFs of the present invention, or homolog thereof, 
in a test sample, using one of the DFs or antibodies of the present invention. 

In detail, such methods comprise incubating a test sample with one or 
20 more of the antibodies or one or more of the DFs of the present invention and 

assaying for binding of the DFs or antibodies to components within the test 
sample. 

Conditions for incubating a DF or antibody with a test sample vary. 
Incubation conditions depend on the format employed in the assay, the 
25 detection methods employed, and the type and nature of the DF or antibody 

used in the assay. One skilled in the art will recognize that any one of the 
commonly available hybridization, amplification or immunological assay 
formats can readily be adapted to employ the DFs or antibodies of the present 
invention. Examples of such assays can be found in Chard, T., An 
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Introduction to Radioimmunoassay and Related Techniques, Hsevier Science 
Publishers, Amsterdam, The Netherlands (1986); Bullock, G.R. et al.. 
Techniques in Immunocytochemistry, Academic Press, Orlando, FL Vol. 1 
(1982), Vol. 2 (1983), Vol. 3 (1985); Tijssen, P., Practice and Theory of 

5 Enzyme Immunoassays: Laboratory Techniques in Biochemistry and Molecular 

Biology, Elsevier Science Publishers, Amsterdam, The Netherlands (1985). 

The test samples of the present invention include cells, protein or 
membrane extracts of cells, or biological fluids such as sputum, blood, serum, 
plasma, or urine. The test sample used in the above-described method will 

10 vary based on the assay format, nature of the detection method and the tissues, 

cells or extracts used as the sample to be assayed. Methods for preparing 
protein extracts or membrane extracts of cells are well known in the art and 
can be readily be adapted in order to obtain a sample which is compatible with 
the system utilized. 

15 in another embodiment of the present invention, kits are provided 

which contain the necessary reagents to carry out the assays of the present 
invention. 

Specifically, the invention provides a compartmentalized kit to receive, 
in close confinement, one or more containers which comprises: (a) a first 
20 container comprising one of the DFs or antibodies of the present invention; 

and (b) one or more other containers comprising one or more of the following: 
wash reagents, reagents capable of detecting presence of a bound DF or 
antibody. 

In detail, a compartmentalized kit includes any kit in which reagents 
25 are contained in separate containers. Such containers include small glass 

containers, plastic containers or strips of plastic or paper. Such containers 
allows one to efficiently transfer reagents from one compartment to another 
compartment such that the samples and reagents are not cross-contaminated, 
and the agents or solutions of each container can be added in a quantitative 
30 fashion from one compartment to another. Such containers will include a 

container which will accept the test sample, a container which contains the 
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antibodies used in the assay, containers which contain wash reagents (such as 
phosphate buffered saline, Tris-buffers, etc.), and containers which contain the 
reagents used to detect the bound antibody or DF. 

Types of detection reagents include labelled nucleic acid probes, 
5 labelled secondary antibodies, or in the alternative, if the primary antibody is 

labelled, the enzymatic, or antibody binding reagents which are capable of 
reacting with the labelled antibody. One skilled in the art will readily 
recognize that the disclosed DFs and antibodies of the present invention can 
be readily incorporated into one of the established kit formats which are well 
10 known in the art. 



4. Screening Assay for Binding Agents 



Using the isolated proteins of the present invention, the present 
invention further provides methods of obtaining and identifying agents which 
bind to a protein encoded by one of the ORFs of the present invention or to 
15 one of the fragments and the Haemophilus genome herein described. 

In detail, said method comprises the steps of: 

(a) contacting an agent with an isolated protein encoded by one of 
the ORFs of the present invention, or an isolated fragment of 
the Haemophilus genome; and 
20 (b) determining whether the agent binds to said protein or said 

fragment. 

The agents screened in the above assay can be, but are not limited to, 
peptides, carbohydrates, vitamin derivatives, or other pharmaceutical agents. 
The agents can be selected and screened at random or rationally selected or 
25 designed using protein modeling techniques. 

For random screening, agents such as peptides, carbohydrates, 
pharmaceutical agents and the like are selected at random and are assayed for 
their ability to bind to the protein encoded by the ORF of the present 
invention. 
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Alternatively, agents may be rationally selected or designed. As used 
herein, an agent is said to be "rationally selected or designed" when the agent 
is chosen based on the configuration of the particular protein. For example, 
one skilled in the art can readily adapt currently available procedures to 
5 generate peptides, pharmaceutical agents and the like capable of binding to a 

specific peptide sequence in order to generate rationally designed antipeptide 
peptides, for example see Hurby et al.. Application of Synthetic Peptides: 
Antisense Peptides," In Synthetic Peptides. A User's Guide, W.H. Freeman, 
NY (1992), pp. 289-307, and Kaspczak et al., Biochemistry 28:9230-8 (1989), 
10 or pharmaceutical agents, or the like. 

In addition to the foregoing, one class of agents of the present 
invention, as broadly described, can be used to control gene expression 
through binding to one of the ORFs or EMFs of the present invention. As 
described above, such agents can be randomly screened or rationally 
15 designed/selected. Targeting the ORF or EMF allows a skilled artisan to 

design sequence specific or element specific agents, modulating the expression 
of either a single ORF or multiple ORFs which rely on the same EMF for 
expression control. 

One class of DNA binding agents are agents which contain base 
20 residues which hybridize or form a triple helix formation by binding to DNA 

or RNA. Such agents can be based on the classic phosphodiester, ribonucleic 
acid backbone, or can be a variety of sulfhydryl or polymeric derivatives 
which have base attachment capacity. 

Agents suitable for use in these methods usually contain 20 to 40 bases 
25 and are designed to be complementary to a region of the gene involved in 

transcription (triple helix - see Lee et al. , Nucl. Acids Res. 6:3073 (1979); 
Cooney etal., Science 241A56 (1988); and Dervan et al., Science 251: 1360 
(1991)) or to the mRNA itself (antisense - Okano, J. Neurochem. 56:560 
(1991); Oligodeoxy nucleotides as Antisense Inhibitors of Gene Expression, 
30 CRC Press, Boca Raton, FL (1988)). Triple helix- formation optimally results 

in a shut-off of RNA transcription from DNA, while antisense RNA 
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hybridization blocks translation of an mRNA molecule into polypeptide. Both 
techniques have been demonstrated to be effective in model systems. 
Information contained in the sequences of the present invention is necessary 
for the design of an antisense or triple helix oligonucleotide and other DNA 

5 binding agents. 

Agents which bind to a protein encoded by one of the ORFs of the 
present invention can be used as a diagnostic agent, in the control of bacterial 
infection by modulating the activity of the protein encoded by the ORF. 
Agents which bind to a protein encoded by one of the ORFs of the present 

0 invention can be formulated using known techniques to generate a 

pharmaceutical composition for use in controlling Haemophilus growth and 
infection. 

5. Vaccine and Pharmaceutical Composition 

The present invention further provides pharmaceutical agents which can 
5 be used to modulate the growth of Haemophilus influenzae, or another related 

organism, in vivo or in vitro. As used herein, a "pharmaceutical agent" is 
defined as a composition of matter which can be formulated using known 
techniques to provide a pharmaceutical compositions. As used herein, the 
"pharmaceutical agents of the present invention" refers the pharmaceutical 
) agents which are derived from the proteins encoded by the ORFs of the present 

invention or are agents which are identified using the herein described assays. 

As used herein, a pharmaceutical agent is said to "modulated the 
growth of Haemophilus sp. , or a related organism, in vivo or in vitro, M when 
the agent reduces the rate of growth, rate of division, or viability of the 
> organism in question. The pharmaceutical agents of the present invention can 

modulate the growth of an organism in many fashions, although an 
understanding of the underlying mechanism of action is not needed to practice 
the use of the pharmaceutical agents of the present invention. Some agents 
will modulate the growth by binding to an important protein thus blocking the 
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biological activity of the protein, while other agents may bind to a component 
of the outer surface of the organism blocking attachment or rendering the 
organism more prone to act the bodies nature immune system. Alternatively, 
the agent may be comprise a protein encoded by one of the ORFs of the 
5 present invention and serve as a vaccine. The development and use of a 

vaccine based on outer membrane components, such as the LPS, are well 
known in the art. 

As used herein, a "related organism" is a broad term which refers to 
any organism whose growth can be modulated by one of the pharmaceutical 
10 agents of the present invention. In general, such an organism will contain a 

homolog of the protein which is the target of the pharmaceutical agent or the 
protein used as a vaccine. As such, related organism do not need to be 
bacterial but may be fungal or viral pathogens. 

The pharmaceutical agents and compositions of the present invention 
15 may be administered in a convenient manner such as by the oral, topical, 

intravenous, intraperitoneal, intramuscular, subcutaneous, intranasal or 
intradermal routes. The pharmaceutical compositions are administered in an 
amount which is effective for treating and/or prophylaxis of the specific 
indication. In general, they are administered in an amount of at least about 10 
20 Mg /kg body weight and in most cases they will be administered in an amount 

not in excess of about 8 mg/Kg body weight per day. In most cases, the 
dosage is from about 10 j*g/kg to about 1 mg/kg body weight daily, taking into 
account the routes of administration, symptoms, etc. 

The agents of the present invention can be used in native form or can 
25 be modified to form a chemical derivative. As used herein, a molecule is said 

to be a "chemical derivative" of another molecule when it contains additional 
chemical moieties not normally a part of the molecule. Such moieties may 
improve the molecule's solubility, absorption, biological half life, etc. The 
moieties may alternatively decrease the toxicity of the molecule, eliminate or 
30 attenuate any undesirable side effect of the molecule, etc. Moieties capable of 



-42- 

SUBSTTTUTE SHEET (RULE 26) 



BNSOOCID- <WO 9633276A1J_> 



WO 96/33276 



PCT/US96/05320 



mediating such effects are disclosed in Remington's Pharmaceutical Sciences 
(1980). 

For example, a change in the immunological character of the functional 
derivative, such as affinity for a given antibody, is measured by a competitive 
5 type immunoassay. Changes in immunomodulation activity are measured by 

the appropriate assay. Modifications of such protein properties as redox or 
thermal stability, biological half-life, hydrophobicity, susceptibility to 
proteolytic degradation or the tendency to aggregate with carriers or into 
mul timers are assayed by methods well known to the ordinarily skilled artisan. 

10 The therapeutic effects of the agents of the present invention may be 

obtained by providing the agent to a patient by any suitable means (i.e., 
inhalation, intravenously, intramuscularly, subcutaneously, enterally, or 
parenterally). It is preferred to administer the agent of the present invention 
so as to achieve an effective concentration within the blood or tissue in which 

15 the growth of the organism is to be controlled. 

To achieve an effective blood concentration, the preferred method is 
to administer the agent by injection. The administration may be by continuous 
infusion, or by single or multiple injections. 

In providing a patient with one of the agents of the present invention, 

20 the dosage of the administered agent will vary depending upon such factors as 

the patient's age, weight, height, sex, general medical condition, previous 
medical history, etc. In general, it is desirable to provide the recipient with 
a dosage of agent which is in the range of from about 1 pg/kg to 10 mg/kg 
(body weight of patient), although a lower or higher dosage may be admin - 

25 istered. The therapeutically effective dose can be lowered by using 

combinations of the agents of the present invention or another agent. 

As used herein, two or more compounds or agents are said to be 
administered "in combination" with each other when either ( 1 ) the 
physiological effects of each compound, or (2) the serum concentrations of 

30 each compound can be measured at the same time. The composition of the 
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present invention can be administered concurrently with, prior to, or following 
the administration of the other agent. 

The agents of the present invention are intended to be provided to 
recipient subjects in an amount sufficient to decrease the rate of growth (as 
5 defined above) of the target organism. 

The administration of the agent(s) of the invention may be for either a 
"prophylactic" or "therapeutic" purpose. When provided prophylactically, the 
agent(s) are provided in advance of any symptoms indicative of the organisms 
growth. The prophylactic administration of the agent(s) serves to prevent, 
10 attenuate, or decrease the rate of onset of any subsequent infection. When 

provided therapeutically, the agent(s) are provided at (or shortly after) the 
onset of an indication of infection. The therapeutic administration of the 
compound(s) serves to attenuate the pathological symptoms of the infection and 
to increase the rate of recovery. 
15 The agents of the present invention are administered to the mammal in 

a pharmaceutical^ acceptable form and in a therapeutically effective 
concentration. A composition is said to be "pharmacologically acceptable" if 
its administration can be tolerated by a recipient patient. Such an agent is said 
to be administered in a "therapeutically effective amount" if the amount 
20 administered is physiologically significant. An agent is physiologically 

significant if its presence results in a detectable change in the physiology of a 
recipient patient. 

The agents of the present invention can be formulated according to 
known methods to prepare pharmaceutically useful compositions, whereby 

25 these materials, or their functional derivatives, are combined in admixture with 

a pharmaceutically acceptable carrier vehicle. Suitable vehicles and their 
formulation, inclusive of other human proteins, e.g., human serum albumin, 
are described, for example, in Remington's Pharmaceutical Sciences (16th ed., 
Osol, A., Ed., Mack, Eastern PA (1980)). In order to form a pharmaceutically 

30 acceptable composition suitable for effective administration, such compositions 
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will contain an effective amount of one or more of the agents of the present 
invention, together with a suitable amount of carrier vehicle. 

Additional pharmaceutical methods may be employed to control the 
duration of action. Control release preparations may be achieved through the 
5 use of polymers to complex or absorb one or more of the agents of the present 

invention. The controlled delivery may be exercised by selecting appropriate 
macro molecules (for example polyesters, polyamino acids, polyvinyl, 
pyrrolidone, ethylenevinylacetate, methylcellulose, carboxymethylcellulose, 
or protamine, sulfate) and the concentration of macromolecules as well as the 

10 methods of incorporation in order to control release. Another possible method 

to control the duration of action by controlled release preparations is to 
incorporate agents of the present invention into particles of a polymeric 
material such as polyesters, polyamino acids, hydrogels, poly (lactic acid) or 
ethylene vinylacetate copolymers. Alternatively, instead of incorporating these 

15 agents into polymeric particles, it is possible to entrap these materials in 

microcapsules prepared, for example, by coacervation techniques or by 
interfacial polymerization, for example, hydroxymethylcellulose or gelatine- 
microcapsules and poly(methylmethacylate) microcapsules, respectively, or in 
colloidal drug delivery systems, for example, liposomes, albumin 

20 microspheres, microemulsions, nanoparticles, and nanocapsules or in 

macroemulsions. Such techniques are disclosed in Remington's 
Pharmaceutical Sciences (1980). 

The invention further provides a pharmaceutical pack or kit comprising 
one or more containers filled with one or more of the ingredients of the 

25 pharmaceutical compositions of the invention. Associated with such 

containers) can be a notice in the form prescribed by a governmental agency 
regulating the manufacture, use or sale of pharmaceuticals or biological 
products, which notice reflects approval by the agency of manufacture, use or 
sale for human administration. In addition, the agents of the present invention 

30 may be employed in conjunction with other therapeutic compounds. 
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6. Shot-Gun Approach to Megabase DNA Sequencing 

The present invention further provides the first demonstration 
that a sequence of greater than one megabase can be sequenced using a random 
shotgun approach. This procedure, described in detail in the examples that 
follow, has eliminated the up front cost of isolating and ordering overlapping 
or contiguous subclones prior to the start of the sequencing protocols. 

Certain aspects of the present invention are described in greater detail 
in the non-limiting Examples that follow. 
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Examples 
Experimental Design and Methods 

1. Shotgun Sequencing Strategy 

The overall strategy for a shotgun approach to whole genome 
5 sequencing is outlined in Table 3. The theory of shotgun sequencing follows 

from the Lander and Waterman (Landerman and Waterman, Genomics 2: 231 
(1988)) application of the equation for the Poisson distribution p x = mWx!, 
where x is the number of occurrences of an event, m is the mean number of 
occurrences, and p x is the probability that any given base is not sequenced after 

10 a certain amount of random sequence has been generated. If L is the genome 

length, n is the number of clone insert ends sequenced, and w is the 
sequencing read length, then m = nw/L, and the probability that no clone 
originates at any of the w bases preceding a given base, i.e., the probability 
that the base is not sequenced, is p 0 = e*™. Using the fold coverage as the unit 

15 for m, one sees that after 1.8 Mb of sequence has been randomly generated, 

m = 1, representing IX coverage. In this case, p 0 = e* 1 = .37, thus 
approximately 37% is unsequenced. For example, 5X coverage 
(approximately 9500 clones sequenced from both insert ends and an average 
sequence read length of 460 bp) yields p 0 = e 5 = 0.0067, or 0.67% 

20 unsequenced. The total gap length is Le*", and the average gap size is L/n. 

5X coverage would leave about 128 gaps averaging about 100 bp in size. The 
treatment is essentially that of Lander and Waterman, Genomics 2:231 (1988). 
Table 4 illustrates the coverage for a 1.9 Mb genome with an average fragment 
size of 460 bp. 
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2. Random Library Construction 

In order to approximate the random model described above during 
actual sequencing, a nearly ideal library of cloned genomic fragment is 
required. The following library construction procedure was developed to 

5 achieve this. 

H. influenzae Rd KW20 DNA was prepared by phenol extraction. A 
mixture (3.3 ml) containing 600 *xg DNA, 300 mM sodium acetate, 10 mM 
Tris-HCl, 1 mM Na-EDTA, 30% glycerol was sonicated (Branson Model 450 
Sonicator) at the lowest energy setting for 1 min. at 0° using a 3 mm probe. 
10 The DNA was ethanol precipitated and redissolved in 500 fil TE buffer. To 

create blunt-ends, a 100 pi aliquot was digested for 10 min at 30° in 200 ^1 
BAL31 buffer with 5 units BAL31 nuclease (New England BioLabs). The 
DNA was phenol-extracted, ethanol-precipitated, redissolved in 100 /*1 TE 
buffer, electrophoresed on a 1.0% low melting agarose gel, and the 1.6-2.0 
15 kb size fraction was excised, phenol-extracted, and redissolved in 20 pi TE 

buffer. A two-step ligation procedure was used to produce a plasmid library 
with 97% insert of which >99% were single inserts. The first ligation 
mixture (50 pi) contained 2 pg of DNA fragments, 2 pg Smal/BAP pUC18 
DNA (Pharmacia), and 10 units T4 ligase (GIBCO/BRL), and incubation was 
20 at 14° for 4 hr. After phenol extraction and ethanol precipitation, the DNA 

was dissolved in 20 pi TE buffer and electrophoresed on a 1.0% low melting 
agarose gel. A ladder of ethidium bromide-stained linear bands, identified by 
size as insert (i), vector (v), v+i, v+2i, v+3i, ... was visualized by 360 nm 
UV light, and the v+i DNA was excised and recovered in 20 pi TE. The v+i 
25 DNA was blunt-ended by T4 polymerase treatment for 5 min. at 37° in a 

reaction mixture (50 pi) containing the v+i linears, 500 pM each of the 4 
dNTP's, and 9 units of T4 polymerase (New England BioLabs) under 
recommended buffer conditions. After phenol extraction and ethanol 
precipitation the repaired v+i linears were dissolved in 20 pi TE. The final 
30 ligation to produce circles was carried out in a 50 pi reaction containing 5 pi 

of v+i linears and 5 units of T4 ligase at 14° overnight. After 10 min. at 70° 
the reaction mixture was stored at -20°. _ 4 g_ 
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This two-stage procedure resulted in a molecularly random collection 
of single-insert plasmid recombinants with minimal contamination from 
double-insert chimeras ( < 1 %) or free vector ( < 3 %). Since deviation from 
randomness is most likely to occur during cloning, E. coli host cells deficient 
5 in all recombination and restriction functions (A. Greener, Strategies 3 (1):5 

(1990)) were used to prevent rearrangements, deletions, and loss of clones by 
restriction. Transformed cells were plated directly on antibiotic diffusion 
plates to avoid the usual broth recovery phase which allows multiplication and 
selection of the most rapidly growing cells. Plating occured as follows: 

10 A 100 /il aliquot of Epicurian Coli SURE II Supercompetent Cells 

(Stratagene 200152) was thawed on ice and transferred to a chilled Falcon 
2059 tube on ice. A 1.7 ^1 aliquot of 1.42 M (J-mercaptoethanol was added 
to the aliquot of cells to a final concentration of 25 mM. Cells were incubated 
on ice for 10 min. A 1 /xl aliquot of the final ligation was added to the cells 

15 and incubated on ice for 30 min. The cells were heat pulsed for 30 sec. at 42° 

and placed back on ice for 2 min. The outgrowth period in liquid culture was 
eliminated from this protocol in order to minimize the preferential growth of 
any given transformed cell. Instead the transformation were plated directly on 
a nutrient rich SOB plate containing a 5 ml bottom layer of SOB agar 

20 (1.5% SOB agar: 20 g tryptone, 5 g yeast extract, 0.5 g NaCl, 1.5% Difco 

Agar/L). The 5 ml bottom layer is supplemented with 0.4 ml ampicillin 
(50 mg/ml)/100 ml SOB agar. The 15 ml top layer of SOB agar is 
supplemented with 1 ml X-Gal (2%), 1 ml MgCl 2 (1 M), and 1 ml 
MgSO 4 /100 ml SOB agar. The 15 ml top layer was poured just prior to 

25 plating. Our titer was approximately 100 colonies/ 10 id aliquot of 

transformation. 

All colonies were picked for template preparation regardless of size. 
Only clones lost due to "poison" DNA or deleterious gene products would be 
deleted from the library, resulting in a slight increase in gap number over that 
30 expected. 
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In order to evaluate the quality of the H. influenzae library, sequence 
data were obtained from approximately 4000 templates using the Ml 3-21 
primer. The random sequence fragments were assembled using the 
AutoAssembler™ software (Applied Biosystems division of Perkin-Elmer 

5 (AB)) after obtaining 1300, 1800, 2500, 3200, and 3800 sequence fragments, 

and the number of unique assembled base pairs was determined. Based on the 
equations described above, an ideal plot of the number of base pairs remaining 
to be sequenced as a function of the # of sequenced fragments obtained with 
an average read length of 460 bp for a 2.5X10 6 and a 1 .9X10 6 bp genome was 

10 determined (Figure 3). The progression of assembly was plotted using the 

actual data obtained from the assembly of up to 3800 sequence fragments and 
compared the data that is provided in the ideal plot (Figure 3). Figure 3 
illustrates that there was essentially no deviation of the actual assembly data 
from the ideal plot, indicating that we had constructed close to an ideal random 

15 library with minimal contamination from double insert chimeras and free of 

vector. 



3. Random DNA Sequencing 



High quality double stranded DNA plasmid templates (19,687) were 
prepared using a "boiling bead" method developed in collaboration with 

20 Advanced Genetic Technology Corp. (Gaithersburg, MD) (Adams et al. f 

Science 252:1651 (1991); Adams et al. f Nature 355:632 (1992)). Plamid 
preparation was performed in a 96-well format for all stages of DNA 
preparation from bacterial growth through final DNA purification. Template 
concentration was determined using Hoechst Dye and a Millipore Cytofluor. 

25 DNA concentrations were not adjusted, but low-yielding templates were 

identified where possible and not sequenced. Templates were also prepared 
from two H. influenzae lambda genomic libraries. An amplified library was 
constructed in vector Lambda GEM- 12 (Promega) and an unamplified library 
was constructed in Lambda DASH II (Stratagene). In particular, for the 
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unamplified lambda library, H. influenzae Rd KW20 DNA (> 100 kb) was 
partially digested in a reaction mixture (200 fil) containing 50 jxg DNA, IX 
Sau3Al buffer, 20 units Sau3AI for 6 min. at 23°. The digested DNA was 
phenol-extracted and electrophoresed on a 0.5% low melting agarose gel at 
5 2V/cm for 7 hours. Fragments from 15 to 25 kb were excised and recovered 

in a final volume of 6 /il. One fti of fragments was used with 1 ftl of DASHII 
vector (Stratagene) in the recommended ligation reaction. One /zl of the 
ligation mixture was used per packaging reaction following the recommended 
protocol with the Gigapack II XL Packaging Extract (Stratagene, #227711). 

10 Phage were plated directly without amplification from the packaging mixture 

(after dilution with 500 pA of recommended SM buffer and chloroform 
treatment). Yield was about 2.5X10 3 pfu//*l. The amplified library was 
prepared essentially as above except the lambda GEM- 12 vector was used. 
After packaging, about 3.5x10* pfu were plated on the restrictive NM539 host. 

15 The lysate was harvested in 2 ml of SM buffer and stored frozen in 7% 

dimethylsulfoxide. The phage titer was approximately lxlO 9 pfu/ml. 

Liquid lysates (10 ml) were prepared from randomly selected plaques 
and template was prepared on an anion-exchange resin (Qiagen). Sequencing 
reactions were carried out on plasmid templates using the AB Catalyst 

20 LabStation with Applied Biosystems PRISM Ready Reaction Dye Primer 

Cycle Sequencing Kits for the M 13 forward (Ml 3-21) and the M 13 reverse 
(M13RP1) primers (Adams et al. t Nature 568:474 (1994)). Dye terminator 
sequencing reactions were carried out on the lambda templates on a Perkin- 
Elmer 9600 Thermocycler using the Applied Biosystems Ready Reaction Dye 

25 Terminator Cycle Sequencing kits. T7 and SP6 primers were used to sequence 

the ends of the inserts from the Lambda GEM- 12 library and T7 and T3 
primers were used to sequence the aids of the inserts from the Lambda DASH 
II library. Sequencing reactions (28,643) were performed by eight individuals 
using an average of fourteen AB 373 DNA Sequencers per day over a 3 month 

30 period. All sequencing reactions were analyzed using the Stretch modification 

of the AB 373, primarily using a 34 cm well-to-read distance. The overall 
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sequencing success rate was 84% for Ml 3-21 sequences, 83% for M13RP1 
sequences and 65% for dye-terminator reactions. The average usable read 
length was 485 bp for M13-21 sequences, 444 bp for M13RP1 sequences, and 
375 bp for dye-terminator reactions. Table 5 summarizes the high-throughput 

5 sequencing phase of the invention. 

Richards et al (Richards ee al, Automated DNA sequencing and 
Analysis, M.D. Adams, C. Fields, J.C. Venter, Eds. (Academic Press, 
London, 1994), Chap. 28.) described the value of using sequence from both 
ends of sequencing templates to facilitate ordering of contigs in shotgun 

10 assembly projects of lambda and cosmid clones. We balanced the desirability 

of both-end sequencing (including the reduced cost of lower total number of 
templates) against shorter read-lengths for sequencing reactions performed with 
the M13RP1 (reverse) primer compared to the Ml 3-21 (forward) primer. 
Approximately one-half of the templates were sequenced from both ends. In 

15 total, 9,297 M13RP1 sequencing reactions were done. Random reverse 

sequencing reactions were done based on successful forward sequencing 
reactoins. Some M13RP1 sequences were obtained in a semi-directed fashion: 
Ml 3-21 sequences pointing outward at the ends of contigs were chosen for 
M13RP1 sequencing in an effort to specifically order contigs. The semi- 

20 directed strategy was effective, and clone-based ordering formed an integral 

part of assembly and gap closure (see below). 

4. Protocol for Automated Cycle Sequencing 

The sequencing consisted of using eight ABI Catalyst robots and 
fourteen AB 373 Automated DNA Sequencers. The Catalyst robot is a 
25 publicly available sophisticated pipetting and temperature control robot which 

has been developed specifically for DNA sequencing reactions. The Catalyst 
combines pre-aliquoted templates and reaction mixes consisting of deoxy- and 
dideoxynucleotides, the Taq thermostable DNA polymerase, fluorescently- 
labelled sequencing primers, and reaction buffer. Reaction mixes and 
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templates were combined in the wells of an aluminum 96-well thermocycling 
plate. Thirty consecutive cycles of linear amplification (e.g., one primer 
synthesis) steps were performed including denaturation, annealing of primer 
and template, and extension of DNA synthesis. A heated lid with rubber 
5 gaskets on the thermocycling plate prevented evaporation without the need for 

an oil overlay. 

Two sequencing protocols were used: dye-labelled primers and dye- 
labelled dideoxy chain terminators. The shotgun sequencing involves use of 
four dye-labelled sequencing primers, one for each of the four terminator 

10 nucleotide. Each dye-primer is labelled with a different fluorescent dye, 

permitting the four individual reactions to be combined into one lane of the 
373 DNA Sequencer for electrophoresis, detection, and base-calling. AB 
currently supplies pre- mixed reaction mixes in bulk packages containing all the 
necessary non-template reagents for sequencing. Sequencing can be done with 

15 both plasmid and PCR-generated templates with both dye-primers and dye- 

terminators with approximately equal fidelity, although plasmid templates 
generally give longer usable sequences. 

Thirty-two reactions were loaded per 373 Sequencer each day, for a 
total of 960 samples. Electrophoresis was run overnight following the 

20 manufacture's protocols, and the data was collected for twelve hours. 

Following electrophoresis and fluorescence detection, the AB 373 performs 
automatic lane tracking and base-calling. The lane-tracking was confirmed 
visually. Each sequence electropherogram (or fluorescence lane trace) was 
inspected visually and assessed for quality. Trailing sequences of low quality 

25 were removed and the sequence itself was loaded via software to a Sybase 

database (archived daily to a 8mm tape). Leading vector poly linker sequence 
was removed automatically by software program. Average edited lengths of 
sequences from the standard ABI 373 were around 400 bp and depended 
mostly on the quality of the template used for the sequencing reaction. All of 

30 the ABI 373 Sequencers were converted to Stretch Liners, which provided a 
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longer electrophoresis path prior to fluorescence detection, thus increasing the 
average number of usable bases to 500-600 bp. 

Informatics 

1. Data Management 

5 A number of information management systems (LIMA) for a large- 

scale sequencing lab have been developed (Kerlavage et al. , Proceedings of the 
Twenty-Sixth Annual Hawaii International Conference on System Sciences, 
tfff. Computer Society Press, Washington D.C., 585 (1993)). The system 
used to collect and assemble the sequence data was developed using the Sybase 

10 relational data management system and was designed to automate data flow 

whereever possible and to reduce user error. The database stores and 
correlates all information collected during the entire operation from template 
preparation to final analysis of the genome. Because the raw output of the AB 
373 Sequencers was based on a Macintosh platform and the data management 

15 system chosen was based on a Unix platform, it was necessary to design and 

implement a variety of multi-user, client server applications which allow the 
raw data as weU as analysis results to flow seamlessly into the database with 
a minimum of user effort. A description of the software programs used for 
large sequence assembly and managment is provided in Figure 4. 

20 2. Assembly 

An assembly engine (T1GR Assembler) was developed for the rapid 
and accurate assembly of thousands of sequence fragments. The AB 
AutoAssembler™ was modified (and named TIGR Editor) to provide a 
graphical interface to the electropherogram for the purpose of editing data 
25 associated with the aligned sequence file output of TIGR Assembler. TIGR 

Editor maintains synchrony between the electropherogram files on the 
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Macintosh platform and the sequence data in the H. influenzae database on the 
Unix platform. 

The TIGR assembler simultaneously clusters and assembles fragments 
of the genome. In order to obtain the speed necessary to assemble more than 
5 10 4 fragments, the algorithm builds a hash table of 10 bp oligonucleotide 

subsequences to generate a list of potential sequence fragment overlaps. The 
number of potential overlaps for each fragment determines which fragments 
are likely to fall into repetitive elements. Beginning with a single seed 
sequence fragment, TIGR Assembler extends the current contig by attempting 

10 to add the best matching fragment based on oligonucleotide content. The 

current contig and candidate fragment are aligned using a modified version of 
the Smith- Waterman algorithm (Waterman, M.S., Methods in Enzymology 
164:165 (1988)) which provides for optimal gapped alignments. The current 
contig is extended by the fragment only if strict criteria for the quality of the 

15 match are met. The match criteria include the minimum length of overlap, the 

maximum length of an unmatched end, and the minimum percentage match. 
These criteria are automatically lowered by the algorithm in regions of 
minimal coverage and raised in regions with a possible repetitive element. The 
number of potential overlaps for each fragment determines which fragments 

20 are likely to fall into repetitive elements. Fragments representing the 

boundaries of repetitive elements and potentially chimeric fragments are often 
rejected based on partial mismatches at the ends of alignments and excluded 
from the current contig. TIGR Assembler is designed to take advantage of 
clone size information coupled with sequencing from both ends of each 

25 template. It enforces the constraint that sequence fragments from two ends of 

the same template point toward one another in the contig and are located 
within a certain ranged of base pairs (definable for each clone based on the 
known clone size range for a given library). Assembly of 24,304 sequence 
fragments of H. influenzae required 30 hours of CPU time using one processor 

30 on a SPARCenter 2000 with 512 Mb of RAM. This process resulted in 

approximately 210 contigs. Because of the high stringency of the TIGR 
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Assembler, all contigs were searched against each other using grasta (a 
modified fasta (Person and Lipman, Proc. Nail. Acad. Sci. U.S.A. 85:2444 
(1988)). In this way, additional overlaps were detected which enabled 
compression oof the data set into 140 contigs. The location of each fragment 
5 in the contigs and extensive information about the consensus sequence itself 

were loaded into the H. influenzae relational database. 

3. Ordering Assembled Contigs 

After assembly the relative positions of the 140 contigs were unknown. 
The contigs were ordered by asm align. Asm align uses a number of 
10 relationships to identify and align contigs that are adjacent to each other. 

Using this algorithm, the 140 contigs were placed into 42 groups totaling 42 
physical gaps (no template DNA for the region) and 98 sequence gaps 
(template available for gap closure). 

Ordering Contigs Separated by Physical Gaps and Achieving Closure 

15 Four integrated strategies were developed to order contigs separated by 

physical gaps. Oligonucleotide primers were designed and synthesized from 
the end of each contig group. These primers were then available for use in 
one or more of the strategies outlined below: 

1. Southern analysis was done to develop a unique "fingerprint" 

20 for a subset of 72 of the above oligonucleotides. This procedure was based 

upon the supposition that labeled oligonucleotides homologous to the ends of 
adjacent contigs should hybridize to common DNA restriction fragments, and 
thus share a similar or identical hybridization pattern or "fingerprint". 
Oligonucleotides were labeled using 50 pmoles of each 20 mer and 250 mCi 

25 of [y- 32 P]ATP and T4 polynucleotide kinase. The labeled oligonucleotides 

were purified using Sephadex G-25 superfine (Pharmacia) and 107 cpm of 
each was used in a Southern hybridization analysis of H. influenzae Rd 
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chromosomal DNA digested with one frequent cutters (Asel) and five less 
frequent cutters (Bgm, EcoRl, PstI, Xbal, and PvuU). The DNA from each 
digest was fractionated on a 0.7% agarose gel arid transferred to Nytran Plus 
nylon membranes (Schleicher & Schuell). Hybridization was carried out for 
5 16 hours at 40°. To remove non-specific signals, each blot was sequentially 

washed at room temperature with increasingly stringent conditions up to 0. IX 
SSC + 0.5% SDS. Blots were exposed to a Phosphorlmager cassette 
(Molecular Dynamics) for several hours and hybridization patterns were 
visually compared. 

10 Adjacent contigs identified in this manner were targeted for specific 

PGR reactions. 

2. Peptide links were made by searching each contig end using 
blastx (Altschul et ai, J. MoL Biol. 215:403 (1990)) against a peptide 
database. If the ends of two contigs matched the same database sequence in 

15 an appropriate manner, then the two contigs were tentatively considered to be 

adjacent to each other. 

3. The two lambda libraries constructed from H. influenaze 
genomic DNA were probed with oligonucleotides designed from the ends of 
contig groups (Kirkness et al., Genomics 70:985 (1991)). The positive 

20 plaques were then used to prepare templates and the sequence was determined 

from each end of the lambda clone insert. These sequence fragments were 
searched using grasta against a database of all contigs. Two contigs that 
matched the sequence from the opposite ends of the same lambda clone were 
ordered. The lambda clone then provided the template for closure of the 

25 sequence gap between the adjacent contigs. The lambda clones were especially 

valuable for solving repeat structures. 

4. To confirm the order of contigs found by the other approaches 
and establish the order of non-ordered contigs, standard and long range (XL) 
PCR reactions were performed as follows. 

30 Standard PCR was performed in the following manner. Each reaction 

contained a 37 M l cocktail; 16.5 fil H 2 0, 3 ^1 25 mM MgCl 2 , 8 pi of a dNTP 
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mix (1.25 mM each dNTP), 4.5 ill 10X PCR core buffer II (Perkin Elmer), 
25 ng H. influenzae Rd KW20 genomic DNA. The appropriate two primers 
(4 pi, 3.2 pmole/fxl) were added to each reaction. A hot start was performed 
at 95° for 5 min followed by a 75° hold. During the hold Amplitaq DNA 
5 polymerase (Perkin Elmer) 0.3 pi in 4.3 ill H 2 0, 0.5 ill 10X PCR core buffer 

n, was added to each reaction. The PCR profile was 25 cycles of 94°/45 sec. , 
denature; 55°/ 1 min., anneal; 72°/3 min, extension. All reactions were 
performed in a 96 well format on a Perkin Elmer Gene Amp PCR System 
9600. 

10 Long range PCR (XL PCR) was performed as follows: Each reaction 

contained a 35.2 ill cocktail; 12.0 /d H 2 0, 2.2 ill 25 mM Mg(OAc) 2 , 4 pi of 
a dNTP mix (200 iiM final concentration), 12.0 ^1 3.3X PCR buffer, 25 ng 
H. influenzae Rd KW20 genomic DNA. The appropriate two primers (5 /il, 
3.2 pmoles//xl) was added to each reaction. A hot start was performed at 94° 

15 for 1 minute. rTth polymerase, 2.0 fd (4 U/reaction) in 2.8 id 3.3X PCR 

buffer II was added to each reaction. The PCR profile was 18 cycles of 
94°/ 15 sec., denature; 62°/8 min., anneal and extend followed by 12 cycles 
94°/ 15 sec., denature; 62°/8 min. (increase 15 sec. /cycle), anneal and extend; 
72°/ 10 min., final extension. All reactions were performed in a 96 well 

20 format on a Perkin Elmer GeneAmp PCR System 9600. 

Although a PCR reaction was performed for essentially every 
combination of physical gap ends, techniques such as Southern fingerprinting, 
database matching, and the probing of large insert clones were particularly 
valuable in ordering con tigs adjacent to each other and reducing the number 

25 of combinatorial PCR reactions necessary to achieve complete gap closure. 

Employing these strategies to an even greater extent in future genome projects 
will increase the overall efficiency of complete genome closure. The number 
of physical gaps ordered and closed by each of these techniques is summarized 
in Table 5. 

30 Sequence information from the ends of 15-20 kb clones is particularly 

suitable for gap closure, solving repeat structures, and providing general 
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confirmation of the overall genome assembly. We were also concerned that 
some fragments of the H. influenaze genome would be non-clonable in a high 
copy plasmid in E. coli. We reasoned that lytic lambda clones would provide 
the DNA for these segments. Approximately 100 random plaques were picked 
5 from the amplified lambda library, templates prepared, and sequence 

information obtained from each end. These sequences were searched (grasta) 
against the contigs and linked in the database to their appropriate con tig, thus 
providing a scaffolding of lambda clones contributing additional support to the 
accuracy of the genome assembly (Fig 5). In addition to confirmation of the 

10 contig structure, the lambda clones provided closure for 23 physical gaps. 

Approximately 78% of the genome is covered by lambda clones. 

Lambda clones were also useful for solving repeat structures. Repeat 
structures identified in the genome were small enough to be spanned by a 
single clone from the random insert library, except for the six ribosomal RNA 

15 operons and one repeat (2 copies) which was 5,340 bp in length. 

Oligonucleotide probes were designed from the unique flanks at the beginning 
of each repeat and hybridized to the lambda libraries. Positive plaques were 
identified for each flank and the sequence fragments from the ends of each 
clone were used to correctly orient the repeats within the genome. 

20 The ability to distinguish and assemble the six ribosomal RNA (rRNA) 

operons of H. influenaze (16S subunit-23S subunit-5S subunit) was a test of 
our overall strategy to sequence and assemble a complex genome which might 
contain a significant number of repeat regions. The high degree of sequence 
similarity and the length of the six operons caused the assembly process to 

25 cluster all the underlying sequences into a few indistinguishable contigs. To 

determine the correct placement of the operons in the sequence, a pair of 
unique flanking sequences was required for each. No unique flanking 
sequences could be found at the left (16S rRNA) ends. This region contains 
the ribosomal promoter and appeared to be non-clonable in the high copy 

30 number pUC18 plasmid. However, unique sequences could be identified at 

the right (5S) ends. Oligonucleotide primers were designed from these six 
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flanking regions and used to probe the two lambda libraries. For each of the 
six rRNA operons at least one positive plaque was identified which completely 
spanned the rRNA operon and contained unique flanking sequence at the 16S 
and 5S ends. These plaques provided the templates for obtaining the unique 
5 sequence for each of the six rRNA operons. 

An additional confirmation of the global structure of the assembled 
circular genome was obtained by comparing a computer generated restriction 
map based on the assembled sequence for the enzymes Apal, Smal, and Rsrll 
with the predicted physical map of Redfield and Lee (Genetic Maps; locus 
10 maps of complex genomes, S.J. O'Brien, Ed. Cold Spring Harbor Laboratory 

Press, New York, N.Y., 1990, 2110.). The restriction fragments from the 
sequence-derived map matched those from the physical map in size and 
relative order (Figure 5). 

Editing 

Simultaneous with the final gap filling process, each contig was edited 
visually by reassembling overlapping 10 kb sections of contigs using the AB 
AutoAssembler™ and the Fast Data Finder™ hardware. AutoAssembler™ 
provides a graphical interface to electropherogram data for editing. The 
electropherogram data was used to assign the most likely base at each position. 
Where a discrepancy could not be resolved or a clear assignment made, the 
automatic base calls were left unchanged. Individual sequence changes were 
written to the electropherogram files and a replication protocol (crash) was 
used to maintain the synchrony of sequence data between the H. influenzae 
database and the electropherogram files. Following editing, contigs were 
reassembled with TIGR Assembler prior to annotation. 

Potential frameshifts identified in the course of annotating the genome 
were saved as reports in the database. These reports include the coordinates 
in a contig which the alignment software (praze) predicts to be the most likely 
location of a missing or inserted base and a representation of the sequence 
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alignment containing the frameshift. Apparent frameshifts were used to 
indicate areas of the sequence which may require further editing. Frameshifts 
were not corrected in cases where clear electropherogram data disagreed with 
a frameshift. Frameshift editing was performed with TIGR Editor. 
5 The rRNA and other repeat regions precluded complete assembly of the 

circular genome with TIGR Assembler. Final assembly of the genome was 
accomplished using comb asm which splices together contigs based on short 
overlaps. 



Accuracy of the Genome Sequence 



10 The accuracy of the H. influenoze genome sequence is difficult to 

quantitate because there is very little previously determined H. influenoze 
sequence and most of these sequences are from other strains. There are, 
however, three parameters of accuracy that can be applied to the data. First, 
the number of apparent frameshifts in predicted H. influenoze genes, based on 

15 database similarities, is 148. Some of these apparent frameshifts may be in the 

da t abase sequences rather than in ours, particularly considering that 49 of the 
apparent frameshifts are based on matches to hypothetical proteins from other 
organisms. Second, there are 188 bases in the genome that remain as N 
ambiguities (1/9,735 bp). Combining these two types of "known" errors, we 

20 can calculate a maximum sequence accuracy of 99.98%. The average 

coverage is 6.5X and less than 1% of the genome is single-fold coverage. 



Identifying Genes 



An attempt was made to predict all of the coding regions of the H. 
influenzae Rd genome and identify genes, tRNAs and rRNAs, as well as other 
25 features of the DNA sequence (e.g., repeats, regulatory sites, replication 

origin sites, nucleotide composition). A description of some of the readily 
apparent sequence features is provided below. 
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The H. influenaze Rd genome is a circular chromosome of 1,830,121 
bp. The overall G/C nucleotide content is approximately 38% (A = 31%, C 
= 19%, G = 19%, T = 31%, IUB = 0.035%). The G/C content of the 
genome was examined with several window lengths to look for global 

5 structural features. With a window of 5,000 bp, the G/C content is relatively 

even except for 7 large G/C-rich regions and several A/T-rich regions (Fig. 
5). The G/C rich regions correspond to six rRNA operons and the location of 
a cryptic mu-like prophage. Genes for several proteins with similarity to 
proteins encoded by bacteriophage mu are located at approximately position 

10 1.56-L59 Mbp of the genome. This area of the genome has a markedly 

higher G/C content than average for H. influenaze ("50% G/C compared to 
"38% for the rest of the genome). No significance has yet been ascertained 
for the source or importance of the A/T rich regions. 

The minimal origin of replication (oriC) in E. coli is a 245 bp region 

15 defined by three copies of a thirteen base pair repeat containing a GATC core 

sequence at one end and four copies of a nine base pair repeat containing a 
TTAT core sequence at the other end. The GATC sites are methylation targets 
and control replication while the TTAT sites provide the binding sites for 
DnaA, the first step in the replication process (Genes V, B. Lewin Ed. (Oxford 

20 University Press, New York, 1994), chap. 18-19). An approximately 281 bp 

sequence (602,483 - 602,764) whose limits are defined by these same core 
sequences appears to define the origin of replication in H. influenaze Rd. 
These coordinates lie between sets of ribosomal operons rrnF, rrnE, rrnD and 
rrnA, mtB, rrnC. These two groups of ribosomal operons are transcribed in 

25 opposite directions and the placement of the origin is consistent with their 

polarity for transcription. Termination of £. coli replication is marked by two 
23 bp termination sequences located "100 kb on either side of the midway 
point at which the two replication forks meet. Two potential termination 
sequences sharing a 10 bp core sequence with the E. coli termination sequence 

30 were identified in H. influenaze at coordinates 1,375,949-1,375,958 and 

1 ,558,759-1 ,558,768. These two sets of coordinates are offset approximately 
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100 kb from the point 180° opposite of the proposed origin of H. influenaze 
replication. 

Six rRNA operons were identified. Each rRNA operon contains three 
rRNA subunits and a variable spacer region in the order: 16S subunit - spacer 
5 region - 23S subunit -5S subunit. The subunit lengths are 1539 bp, 2653 bp, 

and 116 bp, respectively. The G/C content of the three ribosomal subunits 
(50%) is higher than the genome as a whole. The G/C content of the spacer 
region (38%) is consistent with the remainder of the genome. The nucleotide 
sequence of the three rRNA subunits is 100% identical in all six ribosomal 

10 operons. The rRNA operons can be grouped into two classes based on the 

spacer region between the 16S and 23S sequences. The shorter of the two 
spacer regions is 478 bp in length (mifl, rm£, and rrnF) and contains the gene 
for tRNA Glu. The longer spacer is 723 bp in length (rrnA, rrnC, and rrnD) 
and contains the genes for tRNA He and tRNA Ala. The two sets of spacer 

15 regions are also 100% identical across each group of three operons. tRNA 

genes are also present at the 16S and 5S ends of two of the rRNA operons. 
The genes for tRNA Arg, tRNA His, and tRNA Pro are located at the 16S end 
of rrnE while the genes for tRNA Trp, and tRNA Asp are located at the 5S 
end of rrnA. 

20 The predicted coding regions of the H. influenaze genome were initially 

defined by evaluating their coding potential with the program Genemark 
(Borodovsky and Mclninch, Computers Chem. 17(2): 123 (1993)) using codon 
frequency matrices derived from 122 H. influenaze coding sequences in 
GenBank. The predicted coding region sequences (plus 300 bp of flanking 

25 sequence) were used in searches against a database of non-redundant bacterial 

proteins (NRBP) created specifically for the annotation. Redundancy was 
removed from NRBP at two stages. All DNA coding sequences were 
extracted from GenBank (release 85), and sequences from the same species 
were searched against each other. Sequences having >97% similarity over 

30 regions > 100 nucleotides were combined. In addition, the sequences were 

translated and used in protein comparisons with all sequences in Swiss-Prot 
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(release 30). Sequences belonging to the same species and having >98% 
similarity over 33 amino acids were combined. NRBP is composed of 2 1 ,445 
sequences extracted from 23,751 GenBank sequences and 11,183 Swiss-Prot 
sequences from 1,099 different species. 
5 A total of 1,749 predicted coding regions were identified. Searches of 

the H. infuenzxie predicted coding regions were performed using an algorithm 
that translates the query DNA sequence in the three plus-strand reading frames 
for searching against NRBP, identifies the protein sequences that match the 
query, and aligns the protein-protein matches using praze, a modified Smith- 
10 Waterman (Pearson and Lipman, Proc. Natl. Acad. ScL U.S.A. 35:2444 

(1988)) algorithm. In cases where insertion or deletions in the DNA sequence 
produced a frameshift error, the alignment algorithm started with protein 
regions of maximum similarity and extended the alignment to the same 
database match in alternative frames using the 300 bp flanking region. 
15 Regions known to contain frameswft errors were saved in the database and 

evaluated for possible correction. Unidentified predicted coding regions and 
the remaining intergenic sequences were searched against a dataset of all 
available peptide sequences from Swiss-Prot, PIR, and GenBank. 
Identification of operon structures will be facilitated by experimental 
20 determination of transcription promoter and termination sites. 

Each putatively identified //. influenaze gene was assigned to one of 
102 biological role categories adapted from Riley (Riley, M., Microbiology 
Reviews 57(4):S62 (1993)). Assignments were made by linking the protein 
sequence of the predicted coding regions with the Swiss-Prot sequences in the 
25 Riley database. Of the 1,749 predicted coding regions, 724 have no role 

assignment. Of these, no database match was found for 384, while 340 
matched "hypothetical proteins" in the database. Role assignments were made 
for 1,025 of the predicted coding regions. A compilation of all the predicted 
coding regions, their unique identifiers, a three letter gene identifier, percent 
30 identity, percent similarity, and amino acid match length are presented in 

Table 1(a). 
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An annotated complete genome map of H. influenaze Rd is presented 
in Figures 6(A)-(D). The map places each predicted coding region on the H. 
influenaze chromosome, indicates its direction of transcription and color codes 
its role assignment. Role assignments are also represented in Figure 5. 

A survey of the genes and their chromosomal organization in H. 
influenaze Rd make possible a description of the metabolic processes H. 
influenaze requires for survival as a free living organism, the nutritional 
requirements for its growth in the laboratory, and the characteristics which 
make it unique from other organisms specifically as it relates to its 
pathogenicity and virulence. The genome would be expected to have complete 
complements of certain classes of genes known to be essential for life. For 
example, there is a one-to-one correspondence of published £. coli ribosomal 
protein sequences to potential homologs in the H. influenaze database. 
Likewise, as shown in Table 1(a), an aminoacyl tRNA-synthetase is present 
in the genome for each amino acid. Finally, the location of tRNA genes was 
mapped onto the genome. There are 54 identified tRNA genes, including 
representatives of all 20 amino acids. 

In order to survive as a free living organism, H. influenaze must 
produce energy in the form of ATP via fermentation and/or electron transport. 
As a facultative anaerobe, H. influenaze Rd is known to ferment glucose, 
fructose, galactose, ribose, xylose and fucose (Dorocicz et ai , 7. BacterioL 
175:7142 (1993)). The genes identified in Table 1(a) indicate that transport 
systems are available for the uptake of these sugars via the 
phosphoenolpyruvate-phosphotransferase system (PTS), and via non-PTS 
mechanisms. Genes that specify the common phosphate-carriers Enzyme I and 
Hpr (ptsl and ptsH) of the PTS system were identified as well as the glucose 
specific err gene. The ptsH, ptsl, and err genes constitute the pts operon. We 
have not however identified the gene encoding membrane-bound glucose 
specific Enzyme II. The latter enzyme is required for transport of glucose by 
the PTS system. A complete PTS system for fructose was identified. 
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Genes encoding the complete glycolytic pathway and for the production 
of fermentative end products were identified. Growth utilizing anaerobic 
respiratory mechanisms were found by identifying genes encoding functional 
electron transport systems using inorganic electron acceptors such as nitrates, 
5 nitrites, and dimethylsulf oxide. Genes encoding three enzymes of the 

tricarboxylic acid (TCA) cycle appear to be absent from the genome. Citrate 
synthase, isocitrate dehydrogenase, and acordtase were not found by searching 
the predicted coding regions or by using the E. coli enzymes as peptide queries 
against the entire genome in translation. This provides an explanation for the 
10 very high level of glutamate (lg/L) which is required in defined culture media 

(Klein and Luginbuhl, 7. Gen. Microbiol. 775:409 (1979)). Glutamate can be 
directed into the TCA cycle via conversion to alpha-ketoglutarate by glutamate 
dehydrogenase. In the absence of a complete TCA cycle, glutamate 
presumably serves as the source of carbon for biosynthesis of amino acids 
15 using precursors which branch from the TCA cycle. Functional electron 

transport systems are available for the production of ATP using oxygen as a 
terminal electron acceptor. 

Previously unanswered questions regarding pathogenicity and virulence 
can be addressed by examining certain classes of genes such as adhesions and 
20 the lipooligosaccharide biogenesis genes. Moxon and co-workers (Weiser et 

aL, Cell 59:651 (1989)) have obtained evidence that a number of these 
virulence-related genes contain tandem tetramer repeats which undergo 
frequent addition and deletion of one or more repeat units during replication 
such that the reading frame of the gene is changed and its expression thereby 
25 altered. It is now possible, using the complete genome sequence, to locate all 

such tandem repeat tracts (Figure 5) and to begin to determine their roles in 
phase variation of such potential virulence genes. 

H. influenzae Rd possesses a highly efficient natural DNA 
transformation system (Kahn and Smith, 7. Membrane Biol. 138:155 (1984). 
30 A unique DNA uptake sequence site, 5' AAGTGCGGT, present in multiple 

copies in the genome, has been shown to be necessary for efficient DNA 
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uptake. It is now possible to locate all of these sites and completely describe 
their distribution with respect to genie and intergenic regions. Fifteen genes 
involved in transformation have already been described and sequenced 
(Redfield, R., J. BaaerioL 173:5612 (1991); Chandler, M., Proc. Natl. Acad. 
5 Sci. U.S.A 89:1616 (1992); Barouki and Smith, J. Bacterid. 163(2):629 

(1985); Tomb et al. 9 Gene 104:1 (1991); Tomb, J, Proc. Natl Acad. Sci. 
U.S.A 89:10252 (1992)). Six of the genes, comA to comF, comprise an 
operon which is under positive control by a 22-bp palindromic competence 
regulatory element (CRE) about one helix turn upstream of the promoter. The 
10 rec-2 transformation gene is also controlled by this element. It is now possible 

to locate additional copies of CRE in the genome and discover potential 
transformation genes under CRE control. In addition, it may now be possible 
to discover other global regulatory elements with an ease not previously 
possible. 

15 One well-described gene regulatory system in bacteria is the "two- 

component** system composed of a sensor molecule that detects some sort of 
environmental signal and a regulator molecule that is phosphorylated by the 
activated form of the sensor. The regulator protein is generally a transcription 
factor which, when activated by the sensor, turns on or off expression of a 

20 specific set of genes (for review, see Albright et al. , Ann. Rev. Genet. 25:31 1 

(1989); Parkinson and Kofoid, Ann. Rev. Genet. 26:71 (1992)). It has been 
estimated that E. coli harbors 40 sensor-regulator pairs (Albright et al. , Ann. 
Rev. Genet. 23:311 (1989); Parkinson and Kofoid, Ann. Rev. Genet. 26:11 
(1992)). The H. influenaze genome was searched with representative proteins 

25 from each family of sensor and regulator proteins using tblastn and tfasta. 

Four sensor and five regulator proteins were identified with similarity to 
proteins from other species (Table 6). There appears to be a corresponding 
sensor for each regulator protein except CpxR. Searches with the CpxA 
protein from E. coli identified three of the four sensors listed in Table 6, but 

30 no additional significant matches were found. It is possible that the level of 

sequence similarity is low enough to be undetectable with tfasta. No 
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representatives of the NtrC-class of regulators were found. This class of 
proteins interacts directly with the sigma-54 subunit of RNA polymerase, 
which is not present in H. influenaze. All of the regulator proteins fall into the 
OmpR subclass (Albright et al.,Ann. Rev. Genet. 25:311 (1989); Parkinson 
5 and Kofoid, Ann. Rev. Genet. 26:11 (1992)). The phoBR and basRS genes of 

H. influenaze are adjacent to one another and presumably form an operon. 
The nar and arc genes are not located adjacent to one another. 

Some of the most interesting questions that can be answered by a 
complete genome sequence relate to what genes or pathways are absent. The 
LO non-pathogenic H. influenaze Rd strain varies significantly from the pathogenic 

serotype b strains. Many of the differences between these two strains appear 
in factors affecting infectivity. For example, the eight genes which make up 
the fimbrial gene cluster (vanHam et al.. Mol. Microbiol. 13:673 (1994)) 
involved in adhesion of bacteria to host cells are now shown to be absent in the 
15 Rd strain. The pepN and purE genes which flank the fimbrial cluster in H. 

influenaze type b strains are adjacent to one another in the Rd strain (Fig. 7), 
suggesting that the entire fimbrial duster was excised. On a broader level, we 
determined which E. coli proteins are not in H. influenzae by taking advantage 
of a non-redundant set of protein coding genes from E. coli, namely the 
20 University of Wisconsin Genome Project contigs in GenBank: 1,216 predicted 

protein sequences from GenBank accessions D 10483, L10328, U00006, 
U00039, U 14003, and U 18997 (Yura et al. , Nucleic Acids Research 20:3305 
(1992); Burlandef al., Genomics 76:551 (1993)). The minimum threshold for 
matches was set so that even weak matches would be scored as positive, 
25 thereby giving a minimal estimate of the E. coli genes not present in H. 

influenaze. tblastn was used to search each of the E. coli proteins against the 
complete genome. All blast scores > 100 were considered matches. 
Altogether 627 E. coli proteins matched at least one region of the H. 
influenaze genome and 589 proteins did not. The 589 non-matching proteins 
30 were examined and found to contain a disproportionate number of hypothetical 

proteins from E. coli. Sixty-eight percent of the identified E. coli proteins 
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were matched by an H. influenaze sequence whereas only 38% of the 
hypothetical proteins were matched. Proteins are annotated as hypothetical 
based on a lack of matches with any other known protein ( Yura et aL , Nucleic 
Acids Research 20:3305 (1992); Burland et aL, Genomics 16:55 1 (1993)). At 
5 least two potential explanations can be offered for the over representation of 

hypothetical proteins among those without matches: some of the hypothetical 
proteins are not, in fact, translated (at least in the annotated frame), or these 
are E. co/z-specific proteins that are unlikely to be found in any species except 
those most closely related to E. coli, for example Salmonella typhimurium. 

10 A total of 384 predicted coding regions did not display significant 

similarity with a six-frame translation of GenBank release 87. These 
unidentified coding regions were compared to one another with fasta Several 
novel gene families were identified. For example, two predicted coding 
regions without database matches (HI0591, HI0852) share 75% identity over 

15 almost their entire lengths (139 and 143 amino acid residues respectively). 

Their similarity to each other but failure to match any protein available in the 
current databases suggest that they could represent a novel cellular function. 

Other types of analyses can be applied to the unidentified coding 
regions, including hydropathy analysis, which indicates the patterns of 

20 potential membrane-spanning domains that are often conserved between 

members of receptor and transporter gene families, even in the absence of 
significant amino acid identity. Five examples of unidentified predicted 
coding regions that display potential transmembrane domains with a periodic 
pattern that is characteristic of membrane-bound channel proteins are shown 

25 in Figure 8. Such information can be used to focus on specific aspects of 

cellular function that are affected by targeted deletion or mutation of these 
genes. 

Interest in the medically important aspects of H. influenaze biology has 
focused particularly on those genes which determine virulence characteristics 
30 of the organism. Recently, the catalase gene was characterized and sequenced 

as a possible virulence-related gene (Bishai et al. t 7. Bacterid. 7 76:2914 
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(1994)). A number of the genes responsible for the capsular polysaccharide 
have been mapped and sequenced (Kroll et al, Mol Microbiol 5f6J: 1549 
(1991)). Several outer membrane protein genes have been identified and 
sequenced (Langford et al. 9 J. Gen. Microbiol 753:155 (1992)). The 
5 lipooligosaccharide component of the outer membrane and the genes of its 

synthetic pathway are under intensive study (Weiser et al p J. Bacteriol 
775:3304 (1990)). While a vaccine is available, the study of outer membrane 
components is motivated to some extent by the need for improved vaccines. 

Data Availability 

10 The H. influenaze genome sequence has been deposited in the Genome 

Sequence DataBase (GSDB) with the accession number L42023. The 
nucleotide sequence and peptide translation of each predicted coding region 
with identified start and stop codons have also been accessioned by GSDB. 

Production of an Antibody to a Haemophilus influenzae Protein 

15 Substantially pure protein or polypeptide is isolated from the 

transfected or transformed cells using any one of the methods known in the art. 
The protein can also be produced in a recombinant prokaryotic expression 
system, such as E. coli, or can by chemically synthesized- Concentration of 
protein in the final preparation is adjusted, for example, by concentration on 

20 an Amicon filter device, to the level of a few micrograms/ml. Monoclonal or 

polyclonal antibody to the protein can then be prepared as follows: 

Monoclonal Antibody Production by Hybridoma Fusion 

Monoclonal antibody to epitopes of any of the peptides identified and 
isolated as described can be prepared from murine hybridomas according to the 
25 classical method of Kohler, G. and Milstein, C, Nature 256:495 (1975) or 

-70- 

SUBSTITUTE SHEET (RULE 26) 



WO 96/33276 



PCT/US96/05320 



modifications of the methods thereof. Briefly, a mouse is repetitively 
inoculated with a few micrograms of the selected protein over a period of a 
few weeks. The mouse is then sacrificed, and the antibody producing cells of 
the spleen isolated. The spleen cells are fused by means of polyethylene glycol 
5 with mouse myeloma cells, and the excess un fused cells destroyed by growth 

of the system on selective media comprising aminopterin (HAT media). The 
successfully fused cells are diluted and aliquots of the dilution placed in wells 
of a microtiter plate where growth of the culture is continued. Antibody- 
producing clones are identified by detection of antibody in the supernatant 

10 fluid of the wells by immunoassay procedures, such as ELISA, as originally 

described by Engvall, E., Afeth. Enzymol. 70:419 (1980), and modified 
methods thereof. Selected positive clones can be expanded and their 
monoclonal antibody product harvested for use. Detailed procedures for 
monoclonal antibody production are described in Davis, L. et aL Basic 

15 Methods in Molecular Biology Elsevier, New York. Section 21-2 (1989). 

Polyclonal Antibody Production by Immunization 

Polyclonal antiserum containing antibodies to heterogenous epitopes of 
a single protein can be prepared by immunizing suitable animals with the 
expressed protein described above, which can be unmodified or modified to 

20 enhance immunogenics ty. Effective polyclonal antibody production is affected 

by many factors related both to the antigen and the host species. For example, 
small molecules tend to be less immunogenic than other and may require the 
use of carriers and adjuvant. Also, host animals vary in response to site of 
inoculations and dose, with both inadequate or excessive doses of antigen 

25 resulting in low titer antisera. Small doses (ng level) of antigen administered 

at multiple intradermal sites appears to be most reliable. An effective 
immunization protocol for rabbits can be found in Vaitukaitis, J. et aL , 7. 
Clin. Endocrinol Me tab, 53:988-991 (1971). 
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Booster injections can be given at regular intervals, and antiserum 
harvested when antibody titer thereof, as determined semi-quantitatively, for 
example, by double immunodiffusion in agar against known concentrations of 
the antigen, begins to fall. See, for example, Ouchterlony, O. et aL y Chap. 

5 19 in: Handbook of Experimental Immunology, Wier, D., ed, Blackwell 

(1973). Plateau concentration of antibody is usually in the range of 0. 1 to 0.2 
mg/ml of serum (about 12 ^M). Affinity of the an ti sera for the antigen is 
determined by preparing competitive binding curves, as described, for 
example, by Fisher, D., Chap. 42 in: Manual of Clinical Immunology, second 

10 edition, Rose and Friedman, eds., Amer. Soc. For Microbiology, Washington, 

D.C. (1980). 

Antibody preparations prepared according to either protocol are useful 
in quantitative immunoassays which determine concentrations of antigen- 
bearing substances in biological samples; they are also used semi-quantitatively 
15 or qualitatively to identify the presence of antigen in a biological sample. 

Preparation of PCR Primers and Amplification ofDNA 

Various fragments of the Haemophilus influenzae Rd genome, such as 
those disclosed in Tables 1(a) and 2 can be used, in accordance with the 
present invention, to prepare PCR primers for a variety of uses. The PCR 
20 primers are preferably at least 15 bases, and more preferably at least 18 bases 

in length. When selecting a primer sequence, it is preferred that the primer 
pairs have approximately the same G/C ratio, so that melting temperatures are 
approximately the same. The PCR primers and amplified DNA of this 
Example find use in the Examples that follow. 

25 Gene expression from DNA Sequences Corresponding to ORFs 

A fragment of the Haemophilus influenzae Rd genome provided in 
Tables 1(a) or 2 is introduced into an expression vector using conventional 
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technology. (Techniques to transfer cloned sequences into expression vectors 
that direct protein translation in mammalian, yeast, insect or bacterial 
expression systems are well known in the art.) Commercially available vectors 
and expression systems are available from a variety of suppliers including 
5 Stratagene (La Jolla, California), Promega (Madison, Wisconsin), and 

Invitrogen (San Diego, California). If desired, to enhance expression and 
facilitate proper protein folding, the codon context and codon pairing of the 
sequence may be optimized for the particular expression organism, as 
explained by Hatfield et a/., U.S. Patent No. 5,082,767, incorporated herein 

10 by this reference. 

The following is provided as one exemplary method to generate 
polypeptide(s) from cloned ORFs of the Haemophilus genome fragment. 
Since the ORF lacks a poly A sequence because of the bacterial origin of the 
ORF, this sequence can be added to the construct by, for example, splicing out 

15 the poly A sequence from pSG5 (Stratagene) using Bgll and Sail restriction 

endonuclease enzymes and incorporating it into the mammalian expression 
vector pXTl (Stratagene) for use in eukaryotic expression systems. pXTl 
contains the LTRs and a portion of the gag gene from Moloney Murine 
Leukemia Virus. The position of the LTRs in the construct allow efficient 

20 stable transfection. The vector includes the Herpes Simplex thymidine kinase 

promoter and the selectable neomycin gene. The Haemophilus DNA is 
obtained by PCR from the bacterial vector using oligonucleotide primers 
complementary to the Haemophilus DNA and containing restriction 
endonuclease sequences for Pstl incorporated into the 5' primer and Bgtll at 

25 the 5' end of the corresponding Haemophilus DNA 3' primer, taking care to 

ensure that the Haemophilus DNA is positioned such that its followed with the 
poly A sequence. The purified fragment obtained from the resulting PCR 
reaction is digested with Pstl, blunt ended with an exonuclease, digested with 
BglU, purified and ligated to pXTl, now containing a poly A sequence and 

30 digested BglR. 
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The ligated product is transfected into mouse NTH 3T3 cells using 
Lipofectin (Life Technologies, Inc., Grand Island, New York) under 
conditions outlined in the product specification. Positive transfectants are 
selected after growing the transfected cells in 600 ug/ml G418 (Sigma, St. 
5 Louis, Missouri). The protein is preferably released into the supernatant. 

However if the protein has membrane binding domains, the protein may 
additionally be retained within the cell or expression may be restricted to the 
cell surface. 

Since it may be necessary to purify and locate the transfected product, 

10 synthetic 15-mer peptides synthesized from the predicted Haemophilus DNA 

sequence are injected into mice to generate antibody to the polypeptide 
encoded by the Haemophilus DNA. 

If antibody production is not possible, the Haemophilus DNA sequence 
is additionally incorporated into eukaryotic expression vectors and expressed 

15 as a chimeric with, for example, fi-globin. Antibody to fi-globin is used to 

purify the chimeric. Corresponding protease cleavage sites engineered 
between the fi-globin gene and the Haemophilus DNA are then used to separate 
the two polypeptide fragments from one another after translation. One useful 
expression vector for generating B-globin chimerics is pSG5 (Stratagene). This 

20 vector encodes rabbit B-globin. Intron n of the rabbit fi-globin gene facilitates 

splicing of the expressed transcript, and the polyadenylation signal 
incorporated into the construct increases the level of expression. These 
techniques as described are well known to those skilled in the art of molecular 
biology. Standard methods are published in methods texts such as Davis et al. 

25 and many of the methods are available from the technical assistance 

representatives from Stratagene, Life Technologies, Inc., or Promega. 
Polypeptide may additionally be produced from either construct using in vitro 
translation systems such as In vitro Express™ Translation Kit (Stratagene). 
While the present invention has been described in some detail for purposes 

30 of clarity and understanding, one skilled in the art will appreciate that various 
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changes in form and detail can be made without departing from the true scope 
of the invention. 

All patents, patent applications and publications referred to above are 
hereby incorporated by reference. 
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outer membrane protein P5 (ompA) 
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Hindlll restriction endonuclease (hindlllR) 
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(1) GENERAL INFORMATION: 

(i) APPLICANTS: Human Genome Sciences, Inc.; 

Johns Hopkins University 

(ii) TITLE OF INVENTION: The Nucleotide Sequence of 
the Haemophilus influenzae Rd Genome, Fragments 
Thereof, and Uses Thereof 

(iii) NUMBER OF SEQUENCES: 1 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Sterne, Kessler, Goldstein & Fox, P.L.L.C. 

(B) STREET: 1100 New York Avenue, N.W. 

(C) CITY: Washington 

(D) STATE: DC 

(E) COUNTRY: USA 

(F) ZIP : 20005-3934 

<v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3 1/2 inch diskette 

(B) COMPUTER: Dell Pentium 

(C) OPERATING SYSTEM: MS DOS v6.22 

(D) SOFTWARE : ASCII Text 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: (Not Yet Assigned) 

(B) FILING DATE: (Herewith) 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/476,102 

(B) FILING DATE: 07-JUN-1995 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/487,429 

(B) FILING DATE: 07- JUN-1995 
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(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/426,767 

(B) FILING DATE: 21 -APR- 1995 



(2) INFORMATION FOR SEQ ID NO : 1 : 

( i ) S EQUENCE CHARACTERISTICS : 

(A) LENGTH: 1830121 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

GGCCGCGGTG CATCACAAAA CATCATTCCA TCTTCAACAG GTGCAGCGAA AGCAGTAGGT 6 0 

AAAGTATTAC CTGCATTAAA CGGTAAATTA ACTGGTATGG CTTTCCGTGT TCCAACGCCA 120 

AACGTATCTG TtGTTGATTT AACAGTTAAT CTTGAAAAAC CAGCTTCTTA TGATGCAATC 18 0 

AAACAAGCAA TCAAAGATGC AGCGGAAGGT AAAACGTTCA ATGGCGAATT AAAAGG CGT A 24 0 

TTAGGTTACA CTGAAGATGC TGTTGTTTCT ACTGACTTCA ACGGTTGTGC TTTAACTTCT 3 00 

GTATTTGATG CAGACGCTGG TATCGCATTA ACTGATTCTT TCGTTAAATT GGTATCTTGG 360 

TACGATAACG AAACGGGTTA CTCAAACAAA GTATTAGACT TAGTAGCTCA TATCTACAAC 420 

TACAAAGGCT AATTAAAACT TTGAAAAAAT TAACCGCTCT TCGGAGCGTT TTTTATTATC 4 80 

TAGAATTTAA TTTACGCTCT AAAAATGAAC AAGGGATCAC TAAAAATAAT TTAAAACAAT 54 0 

CAATATTCTT CTAGCTTTTA TTCCTATTTA AGATTATATT AGCGCACAAC TGTTCGCTCA 600 

ATGAAAATCA AAAATAGGGT TAATATGAAT CTCGATCTCC ATTTTGTTCA TCGTATTCAA 660 

CAACAAGCCA AAACTCGTAC AAATATGACC GCACTTCGCT ATAAAGAACA CGGCTTGTGG 72 0 

CGAGATATCT CTTGGAAAAA CTTTCAAGAG CAACTCAATC AACTTTCTCG AGCATTGCTT 78 0 

GCTCACAATA TTGACGTACA AGATAAAATC GCCATTTTTG CCCATAATAT GGAACGTTGG 84 0 

ACAATCGTTG ACATTGCGAC CTTACAAATT CGAGCAATCA CAGTGCCTAT TTACGCAACC 9 00 

AATACAGCCC AGCAAGCAGA ATTTATCCTA AATCACGCCG ATGTAAAAAT TCTCTTCGTC 96 0 

GGCGATCAAG AG C AATACG A TCAAACATTG GAAATTGCTC ATCATTGTCC AAAATTACAA 102 0 

AAAATTGTAG CAATGAAATC CACCATTCAA TTACAACAAG ATCCTCTTTC TTGCACTTGG 108 0 
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TTCAAACGCC CAACAAGATG AACTAACCCA ACGCTTAAAC 114 0 

ATTTACGATT ATTTATACCT CAGGCACAAC GGGAGAGCCT 12 00 

CGCTAATCTC GCTCACCAAT TAGAAACACA TGATCTTTCA 12 6 0 

TATTTCACTT TCTTTTTTAC CATTCTCTCA TATTTTTGAA 132 0 

TCTTCATAGA GGCGCAATAC TTTG CTATTT AGAAGACACT 13 8 0 

AACGGAAATT CGCCCAACTT TAATGTGCGC CGTACCACGT 144 0 

TGCCGTATTG GATAAAGTTC AAAAAGCCCC AAAACTTCGC 1500 

AATTTCCGTG GGACAAAAAT ATTTTGATCT ACGTGCGAAT 156 0 

ATTGAAGAAA CAATTTGCTT TAGCAGATAA ATTAGTGCTC 162 0 

GGGGGGACGA ATAAAAATGA TGCCTTGCGG AGGAGCAAAA 16 8 0 

ATTTTTCCAT GCTATTGGTA TCAACATCAA ATTAGGCTAT 174 0 

AACCGTTTCT TGCTGGCATG ATTTCCAATT TAACCCAAAT 1800 

AAAAGCGGAA GTGAAAATTG GGGAAAATAA TGAAATCCTT 186 0 

GAAAGGCTAT TACAAGAAGC CAGAAGAAAC GGCTCAAGCC 1920 

AAAAACTGGC GATGCAGGAG AATTTGACGA ACAAGGCAAT 1980 

CAAAGAATTA ATGAAAACCT CAAACGGCAA ATATATCGCA 2 04 0 

AATCGGTAAA GATAAATTTA TCGAACAAAT TGCGATCATC 210 0 

ATCCGCGCTT ATTGTGCCTT G CTTTGATAG TTTGGAAGAA 216 0 

TAAATATCAT GACCGTTTAG AACTACTAAA AAATTCTGAC 2 22 0 

TCGTATTAAT GCGGTGCAAA AAGAATTGGC TCATTTCGAG 2280 

ACTTTCTCAA GCATTCAGCA TTAAATTAGG CGAAATTACA 23 4 0 

AAAAGTGATT TTAGAACGTT ATCGCAAGCA AATTGAAGCA 24 0 0 

ATAGCTAAAA GGCTCTCAAT TGAGAGCCTT TCTTTTCTAA 246 0 

CTAAAATCAA TGCAAGTCCA TCTTCATTAT TGGTTGCCGT 2 52 0 

TAATTTCATT TGGCGCATTT CCCATTGCAA CACCAAGCCC 2 58 0 

GATCATTAAA ATTATCGCCA AATGCAATCA CTTCATTAGT 2 64 0 

CCAAAAAACG CACCGCACTT CCTTTGGTTG CACTCTTGTG 2 700 

CGTGGGAACG ACAAATACTT AAATGTGGAA ATTTTT CCTT 2 76 0 
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TAAAAGAACT TCAATCTCAA TGATTTCTTC TGCTTCCCCA ATAATTTGAA TTTTATGTGG 
CGAACGTGTC GCTACTTCAT CAAAAGGATG AATCTCAATT TTGGTCACGC TGCGTTCATA 
AATCACCCAT TTATTTTCTA CATCGCGCGC ATGGCAATCA TTATTTGTAT AATAATTCAC 
GCCAAGCAAA GGATGTTCCG CCAAAACAGT ATTAATCTCT AAAATATCTT TTGGCTCAAT 
TTGTACGCTA TAAATTGGTT CGAGATTTTG GTTCAAAATA AGCGCACCAC TAAATGCAAC 
AAGTACATTA TTCGTTTCAA GCTGTTTCCA ATAAGGCAAA ATACCTAAAG GAGAACGCGC 
CGAAATTGGC ACAAAAGGAA TGCCATTCGC CGTTAAACGC TTAATTACCA CAACAGTTCG 
AGGGGAAATT GTATGTTGAG AGGTTAATAA GGTGCCATTA AAATCACTAA ATACTG CTTT 
ATACATCATT ACTCCGAATA AAGATTATTT CTCTGCTAGA TAAGGATCAT CAAAGCCTAA 
TCCTTGCATA ATTTGGG TTT CTAAACTTTC CATTTCTTCC GCTTCATCGT CTTCAATATG 
ATCATAACCG AGTAAATGTA AGCTACCATG CACGACCATA TGCGCCCAAT GTGCCATTAA 
TGGTTTCTCT TGTTCTGAGG CTTCTCGCTC CACGACTTGA CGACAAATAA CGAGATCCCC 
TAACAATGGC AACTCCACCT CATCGGGGCA TTCAAATGGA AATGAAAGCA CGTTAGTCGG 
GCGATCTTTT CCACGATAAG TTAAATTTAA TTCATGGCTT TCGGCTTCAT CCACAATACG 
CACCGTCATT TCGACTTCAT TGCCCTCAGG CTGAACAGCA CCTGTTGCCC ACTGCACAAT 
CTGCTCTTCT GTTGGCAAAC CCTCTATATT TTCTGTGGCA ATTTGCAAAT CAACCAATAC 
ACTTCCCATT CTATTCTCCT AAATTCTTTT CAATTTCTAA C CGCACTTTT GCACTTATTG 
AATACATATA ACAAACGGTG GGCTTATGAC CACTTTTGTA TGTGTAAAAA TGAGGGGAAT 
TATACAATAA TAAACCTACC AATCCCCTAA ATTTAGATTA CATAATTTAT GTTTACCGCT 
AATTTACTCT ATTTTTAACG TACAAAGAGA CAAATCACGC TATTTCCTTA ATCAACACAA 
AACCTCAAAT CTCTCGCTCT TGGTTGAGGT TAATTGTGAA GTCGTAGATG TCCGCTATAA 
ATATTTCCTT TTCCTTCCCT AGCAAAACCA ATTAAAGTCA AATTATGTTT TTCTGCCATT 
GTTACTGCTA AATCCGTTGC GGCAGAAATC GTGACTAACA TTTCCACCCC GCAAGCCACT 
GTTTTTTGCA CCATTTCATA ACTCGCTCGA CTGGAGGCTA AAATAAAACC TCTAGGTTTA 
CCCGATTTTG CGTGCCAACC AAGCAATTTA TCTAATGCAA CGTGGCGACC CACATCTTCG 
TAAATAGCTA ACATACTTCC ATATAAATCG AAAAATGCAC AAGCATGGGT AGCACCAGTT 
TTACATCCAA GCAATTGATT TTTTTGAAGA TCAATAAGAC AACTATCTAG CAAATTTAAA 
TCAAATTGAA AAGTACAGTC TAATTTCGGA AAAG TTTTAT ATACTTGATT CAATTGTTCT 
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GTACCACAAA TACCACAACC AGTACGCCCC GTAAGATTAC GACGATGCTC TTTTAATGCC 
ATAAATTTAC GGCTGGAAAG TTCAATTTGC ACTTCAATAC CATTGCAAAC TTCTACAACA 
T CAAT AC CAT AAATATCTTC TGGTTTATTG ATAATTCCCT CTGTTAGAGA AAAGCCTAAA 
GSAAAATCCT CTAAATTGTT TGGGG AACAC ATCATAACTG CGTGAGAAAT GCCGTTATAT 
ACCAAAGAAA CTGGTATCTC AACAGCCAGT ATGTCCTCTT TTTGTTGCAA AAGTGCGGTC 
AATTTTnCTG TATTTTTTGT CTTTCTAAAA AAGTTGATAG TTTGTCGAAT CCAGCACTGC 
ATTTATTACC AGTCCATTAG TTAAATGTAA AAAGTCTGTT AAATTATTTT TATATTTATG 
ATCTAGATCA CAATATTTGT TGAAGCTAAT GTCATTTGAT TATGGATTTA TCGTTAAAAA 
ATCGTTACTC TAGCCATCCA ATAAATCAAC G TTTTTACGA TGATTTATAT ACTAGAGAGT 
ATTATTTTAA CGAAATCAAC ATAATCTAGC TACTAAACGA ATTTTATTAC TATCTATAAT 
ATCTCAACAA CTTGTTGTTG AAAGGAG TG A TTATGCAGGT CTCAAGAAGA AAATT CTTC A 
AGATCTGTGC AGGAGGTATG GCGGGAACGT CAGCTGCAAT GTTGGGCTTT GCTCCAGCAA 
ACGTATTAGC TGCGCCACGC GAATATAAAT TATTACGCGC GTTTGAATCC CGTAACACCT 
GTACATATTG CGCTGTAAGT TGCGGTATGT TGTTATATAG CACAGGCAAA CCTTACAATT 
CATTAAGCAG CCATACTGGC ACAAATACTC GTTCAAAACT CTTTCATATT GAGGGTGATC 
CAGATCATCC AGTCAGTCGT GGTGCGCTTT GCCCGAAAGG TGCTGGCTCA CTCGATTATG 
TCAATAGTGA AAGCCGTTCT TTATATCCTC AATATCGTGC GCCAGGTTCT GATAAATGGG 
AACGAATTTC TTGGAAAGAT GCCATTAAAC GTATTG CTCG TTTAATGAAA GATGACCGAG 
ATGCCAACTT TGTTGAAAAA GATTCAAATG GAAAAACGGT TAATCGTTGG GCAACGACAG 
GAATTATGAC TG CAT C AG CA ATGAGCAATG AAGCTGCGTT ATTAACACAA AAGTGGATTA 
GAATG CTCGG TATGGTGCCA GTATGTAACC AAGCGAATAC TTGACACGGA CCAACGGTAG 
CAAGTCTTGC TC CATCATTT GGTCGCGGTG C CATG AC AAA TAACTGGGTT GATATTAAAA 
ATGCCAACTT AATCATCGTT CAAGGCGGTA ACCCTGCAGA AGCCCATCCT GTTGGCTTCC 
GTTGGGCAAT TGAAGCGAAG AAAAACGGTG CGAAAATCAT CGTTATTGAT CCGCGTTTTA 
ACCGTACAGC ATCCGTTGCT GATCTTCATG CGCCAATTCG TTCTGGTTCT GATATTACGT 
TCTTAATGGG CGTGATCCGT TACCTATTGG AAACAAACCA AATTCAACAC GAATATGTTA 
AACACTATAC CAACGCATCA TTCTTAATTG ATGAAGGTTT CAAATTTGAA GATGGTTTAT 
TTGTAGGGTA TAACGAAGAA AAACGTAACT AC G AT AAATC TAAATGGAAC T AC C AATTTG 
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ATGAAAATGG TCACGCTAAA CGTGATATGA CATTACAACA TCCTCGTTGT GTCATTAACA 
TCTTAAAAGA GCACGTTTCT CGTTATACCC CAGAAATGGT TGAACGTATT ACAGGCGTAA 
AACAAAAACT CTTCTTACAA ATCTGTGAAG AAATTGGTAA AACCTCTGTG CCAAATAAAA 
CG ATGACG C A TCTATATGCA TTAGGTTTTA CAGAGCATTC AATCGGTACA CAAAATATTC 
GCTCAATGGC GATAATCCAG TTACTTTTAG GTAATATGGG GATGCCAGGT GGCGGTATTA 
ACGCATTACG TGGACACTCC AATGTGCAAG GTACGACAGA TATGGGCTTA TTGCCAATGT 
CTTTACCAGG TTATATGCGT TTGCCAAACG ATAAAGATAC CTCTTACGAT CAATACATTA 
ACGCAATTAC ACCAAAAGAT ATCGTTCCAA ACCAAGTGAA CTATTATCGT CATACTTCAA 
AATTCTTTGT TAGCATGATG AAAACTTTCT ACGGAGATAA TGCCACTAAG GAAAATGGCT 
GGGGATTCGA TTTCTTACCA AAAGCAGATC GCCTATATGA wCCAATTACT CACGTTAAAT 
TGATGAATGA AGGCAAATTA CACGGTTGGA TTTTACAAGG TTTTAACGTA TTAAATTCAC 
TACCAAATAA AAATAAAACG TTATCTGGTA TGAGTAAACT GAAATACTTA GTCGTTATGG 
ATCCATTACA AACTGAATCA TCAGAGTTTT GGAGAAATTT TGGTGAGTCA AATAATGTAA 
ATCCTGCAGA AATTCAAACA GAAGTTTTCC GTTTACCAAC TACTTGTTTC GCAGAAGAAG 
AAGGATCAAT CGTTAATTCT GGTCGCTGGA CTCAATGGCA CTGGAAAGGT TGCGATCAAC 
CGGGAGAAGC CTTACCTGAT GTTGATATTC TTTCTATGCT ACG CGAAGAA ATGCACGAAC 
TTTATAAAAA AGAGGGTGGA CAAGGAATTG AATCTTTTGA AGCGATGACT TGGAATTATG 
CTCAACCACA CTCACCAAGT GCGGTTGAAT TAGCCAAAGA ATTAAATGGT TATGCGCTTG 
AAGATCTTTA TGAtCCAAAC GGTAACTTGA TGTACAAGAA AGGTCAATTA CTCAATGGAT 
TTGCACATTT ACGTGATGAT GGTACAACAA CATCAGGTAA CTGGTTATAT GTTGGTCAAT 
GGACTGAAAA AGGCAACCAA ACTGCTAATC G CG ATAATTC AGATCCATCG GGTTTAGGTT 
GTACTATTGG CTGGGGCTTT GCATGGCCTG CAAACCGCCG CGTACTTTAT AGCCGTGCAT 
CATTAGATAT CAATGGTAAT CCTTGGGATA AAAACCGCCA ATTAATCAAA TGGAACGGTA 
AAAACTGGAA CTGGTTTGAT ATTGCTGACT ACGGTACGCA ACCACCAGGT TCTGATACTG 
GGCCGTTCAT TATGTCCGCA GAAGGCGTAG GACGTTTATT TGCCGTTGAT AAAATTGCAA 
ATGGCCCAAT G C C AG AAC AC TATGAACCAG TTGAAAGCCC AATTGATACA AACCCATTTC 
ATCCAAATGT AGTAACCGAT C CAACTTT AC GTATCTATAA AGAAGATCGT GAATTTATTG 
GTTCAAATAA AGAGTATCCA TTTGTAGCAA CAACTTATCG TTTAACCGAG CATTTCCACA 
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GCTGGACTGC ACAATCTGCA TTAAATATCA TCGCACAACC ACAACAATTT GTGGAAATTG 
GCGAAAAATT AGCGGCAGAA AAAGGCATCC AAAAAGGCGA TATGGTAAAA ATTACTTCTC 
GTCGTGGCTA TATTAAAGCG GTCGCCGTGG TTACAAAACG TCTTAAAGAT CTCGAAATTG 
ATGGGCGTGT CGTACACCAT ATAGGTCTTC CAATTCACTG GAATATGAAG GCATTAAATG 
GCAAAGGTAA CCGTGGATTC TCTACGAATA CCTTAACACC ATCTTGGGGT GAGGCAATCA 
CGCAAACACC AGAATACAAA ACATTCTTGG TAAATATTGA AAAAGTTGGG GAGGCATAAT 
ATGGCTGGAA CTGCTCAAGG CGTTCAAACG CAAGACGTTA TTAAAATCTC CGCAACATCT 
GGTTTAACAC CAGCACCACA AGCGAGAGAT CATAAAGTTG AAATCGCAAA ATTAATTGAT 
GTTTCAACTT GTATAGGATG TAAAGCCTGT CAAGTGGGTT GTTCAGAGTG GAATGATATT 
CGCTCTGATA TCAATGCACA ATGTGTTGGG GTTTATGACA ATCCAGTTGA TCTTGATGCA 
AAAGCATGGA CAGTGATGCG TTTTAACGAA GTTGAAGAAA ACGATCGTTT AGAATGGCTG 
ATTCGTAAAG ATGGCTGTAT GCACTGCACA GAACCTGGTT GTTTAAAGGC TTGTCCTGCA 
CCCGGTGCAA TCATTCAATA TGCTAACGGT ATTGTAGATT TCCAATCCGA TAAATGTATT 
GGTTGTGGTT ATTGTATTGC AGGCTGTCCA TTCAATATTC CACGAATGAA TCCTGAAgAC 
AATCGTGTAT ACAAATGTAC TCTTTGTGTA GATCGTGTTT CTGTGGGACA AGAGCCCGCT 
TGCGTGAAAA CCTGTCCAAC AGGTGCTATT CGTTTCGGCT CTAAAGAAGA AATGAAAATT 
TACGCAGAAC AACGCGTGGC AGATTTAAAA TCCCGTGGTT ATGAAAACGC AGGACTTTAC 
GATCCTCCAG GTGTAGGCGG TACACACGTA ATGTATGTkT TGCATCACGC AGATAAACCT 
GAGCTTTACA ATGGTCTTCC AAAAGATCCT CAAATTGATT TGAGTGTGAC CTTATGGAAA 
GATGTATTGA AACCAGTCGC TGCAGTTGCA ATGGGTGGTC TAGCTCTAGC AGAAGTTGCG 
CACTACTTAA CCGTTGGCCC AAATGTAGAA GAAGATGTGG AAGATCATCA TCACGAATTT 
GAAGAAAATA AACCATCTAA GGGGGAAAAT AATG AG T AAA ATTGAAATTA GCAACGATAC 
TCGCATTATC CGCCATAGAA CCCCAGCACG TATCAGTCAC TGGATGTTGG TTATTTG CTT 
TTTTATGACG ATGTTCACTG GTGTTGCATT TTTCTTCCCT GACTTTGCTT GGCTCACAGA 
AATTTTGGGT ACACCACAAA TTGCGCGCGC CATTCATCCA TTCACAGGGA TTTTAATGTT 
CTTCGCTTTC ATCTACTTAG CGTTATTGTA CTGGGATCAC AATATT C C AG AAAAAAACGA 
TATTCGCTGG GCGAAAGGTG TTAT^AAGT TCTTAAAGGG AATGAACACG CGGTTGCTGA 
TAATGGCAAA TATAACCTTG GTCAAAAAAT GCTCTTTTGG ACATTAAACT TGGCAATGGT 
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AACGTTATTA GTCACTGGCA TCATTATGTG GCGTCAATAT TTTTCACATT ACTTCTCAAT 9 54 0 

CCCAGTATTG CGAATTGCTA TTTTACTCCA CTCTGCAAGT GCTTTTATGT TATTCACTGG 96 00 

TATCTTAGTG CATATATATA TGGCATTTTG GGTTAAAGGA TCAATTCGCG GTATTGTTGA 966 0 

AGGTTGGGTA ACCGTTCGTT GGGCGAAAAA ACACCACCCA AG ATGGTAT C GTGAAGAAGT 972 0 

TTTATCAAAA CTTGAGGAAG ATTTACTCAA CGAGCAATCT GGCAAAGTAG GTAAAACCAA 978 0 

AGTCTTATTT AAAGGATTTG GCAAATAAGC TGAAATCCTG AAAAAGAAAA AGTGCGGTTA 984 0 

AAATTGACCG CACt TTTTAT TATAAAAGTG AGAGAATATG AGTATCAAAA TCTTATCTGA 990 0 

ATCAGAAATT AAACAAGTAG nAAATTCATA TCAAGCCCCA GCGGTTTTAT TTGCCAATCC 9 96 0 

TaAAAATCTT TACCAACGCA GAGCGAAACG TTTAAGAGAC TT AG CAC AAA ATCATCCTCT 10 020 

ATCTGATTAT TTATTATTTG CTGCAGACAT AGTTGAAAGC CAACTTTCCA CGTTAGAAAA 1008 0 

AAATCCTTTA CCGCCACAAC AG CTTG AAC A GTTAAATACT ATCGAGCCAC TAAATG CCAA 1014 0 

AACCTTTAAn AGAAACAGTA TCTGGCGTGA ATACTTAACA GAAATTCTTG ATGAAATAAA 1020 0 

GCCCAAAGCT AACG AG C AAA TTGCTGCAAC AATTGAATTT CTTGAAAAAG CCTCTTCCGC 10260 

TGAATTAGAA GAAATGGCAA ATAArCkCTT AGCACAAGAA TTTAACTTAG TC AG CAGTG A 10320 

TAAAGCCGTC TTTATTTGGG CTGCACTTTC CCTTTATTGG TTACAAGCAG CTCAACAAAT 10380 

TCCTCATAAT AGCCAAGTTG AAAACGCTGA AAATTTACAT CACTGCCCTG TTTGTGGTTC 10440 

TTTACCTGTG GCAAGTATGG TACAAATTGG TACATCACAA GGTTTACGCT ACTTACATTG 10 500 

TAATTTATGT GAAAGTGAAT GGAATTTGGT ACGCGCACAA TGCACCAATT GTAATAGTCA 10 560 

TGACAAACTC GAAATGTGGT CACTAAATGA AGAACTTGCG CTTGTTCGTG CCGAAACCTG 10620 

TGGTAGTTGT GAAAGTTACT TGAAAATGAT GTTCCAAGAA AAAGATC CTT ACGTAGAACC 10680 

TGTAGCCGAT GATTTGGCTT CTATTTTCTT AGATATTGAA ATGGAAGAAA AAGGTTTCGC 10740 

CCGAAGTGGA TTAAATCCAT TTATTTTTCC TGCAGAAGAA GCATAAAAAT AT AG C CTAGA 10800 

AATATCTAGG CTAATTATTT AAATCTATAG ATAACACGCC ATTACCACTG CATTTTCTCG 10860 

TCCACCACTG GGTTTCGGAT AATAATTTTT CCGAATATCC ACTTCATTAA AACCAATTTT 10 920 

TTCATACAAG AAACGAG C AG AGTTAGACTC TCTGACTTCT AGCCATAAGG TTTGG AC AC C 10980 

TTTTTCCTTT AATTGAAAGA TTAATTTTCC TAACAATAAT TTGCCAAATC C ACAAC CTTG 11040 

ATAAGTAGGC AAAATCGCAA TATTAAACAA AGTCG CTTCA TCCAATACTG TTTGGCAAAT 1110 0 

AGCAAAACCG ATAATCTGAT TATTCTCTAT TAATTTTAAA TTGAGATAAC GCTCCCCTTG 1116 0 
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ATTATTTTTT AACGTACCAA ATGACCAAGG CAC CAGATGG GCTTGCTGTT CGATTTCATA 1122 0 

CAATCGCTCG AAATCACAGG CTTCAATTTG AGAAATAATA GACATAATTA AGGCTGCTGA 1128 0 

ATTTGTTGCC ACAACGCTCG TTTGGCTTGA TGATTAGATT GAAATTGCTG CCAACTTGGC 11340 

GAGCGATAAA CCTGCTCAGC CTGCTTGCAA AATGGCAAAG TGCGGTCAAT TTGGTCGCTA 11400 

TTTTCTGATA GTAACCAATA ACGAATAGGC TGTTTACATT CCATATGCTG GATTTGATCG 11460 

TAATTCAAAC ATAAACAATT TTCTTTTTTA AGATTAAGGC TTAACAGCAC ATCAGCCAAC 11520 

AAAGGCGAGC TACTGATATT TTCATCGGAA ACAGTGATAA GGCGAATATT CTCTGCCACA 1158 0 

CTAATTCCTA CTGAACCTTG CAGTACCTCG GGG CGATAT A ATTCCCACTG GGAAATGCCC 1164 0 

ATTTCTTG T A AAAGAAGATC GCGTCTGTTC ATAAGTGAAT TTTTAAAAAC GTTGCTTGAA 11700 

AAATAGTGTT AATTTCTTTT ATTAATCTTT GCTAAAATGG TGCGTAATTT TAACCAATAA 1176 0 

CCCACAGGAT TCAAAGCGTG ATTTCTCTAG AAAGCCAAGT TTTAGAACGC CATATTTCAT 1182 0 

TTTTTGATGG TAAATCGGTA TTATTTGCTG GCGGTATTTC TGATAGTTTT CCTCAAACAT 1188 0 

TAGCCTCAAA ATGCTCATCA ATC CAAATTT GGAGTTGCTA TTTTGACTAT GCAAGAACAC 1194 0 

AAAGTGCGGT CAATTTTTCC GTTGAATTTC AAGGTCAAGC GGATTTAATC GTTTATTATT 12 000 

GG AC TAAAAA CAAACAAGAA GTCAATTTCC AACTGATTCA ACTATTAGCT CAAGCATCTA 12 060 

TTGGTCAAGA AATATTAATT ATTGGTGAAA ACCGTTGTGG CGTGCGTTCA GTAGaAAAAA 12120 

CATTATCACC CTATGGAGAA AT AG C AAAAA TTGACTCTGC TCGTCGTTGC GGTTTATATC 1218 0 

ATTTTTCACT CCAAAATAAA CCGCACTTTG AACTT AAAAA TTTCTGGAGA ACCTACCAAC 12240 

ATTCAACACT TGAAAACTTA ACTATATATA GTTTACCTGG GGTATTTAGT GCGGCGGAAT 12 3 00 

TAGACACAGG CACAGAACTG TTACTTTCAA CTATTGATAA TAAAATAAAA GGAAAAGTGC 123 6 0 

TTGATCTTGG CTGTGGGGCG GGGGTAATTG GCTCTATGAT AAAAAAACGC ACGCCTAACG 124 2 0 

CACAAATTAC AATGACAGAT ATTCACGCAA TGG CATTAGA ATCAGCGCGT AAAACACTTT 124 80 

CTGAAAATCA ATTACAAGGC GAGGTTTACG CCAGTGATGT CTTTTCTGAT ATAGAAGGAA 12 54 0 

AATTTGATTT GwTCATTTCT AAT C CTC CAT TCCACGATGG CATTGATACC GCCTATCGAG 12 6 00 

CAGTGACAGA ACTCATCACT CAAGCGAAAT GGCATCTCAA TCAAGGTGGA GAATTACGCA 12 66 0 

TTGTCGCCAA TTCATTTCTG CCTTATCCCG AATT AC TTAG ACAACATTTC AATGATTATC 12 72 0 

AAGTTCTTGC TCAAACAGGA AAATTTAAAG TGTATTCAGT GAAAAATTAG AAACAAAAAA 12 780 

ATCCCCATCA CAATAATGGG GATTTTTATT ATGCGTATAA ATTTTACCGC ATTTT ATT C A 12 84 0 
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TCCATATATC CTAAACTACG CAGCGCACGT TCATCATCAG CCCAACCAGA TTTCACTTTT 12 900 

ACCCAAAGTT CAAGATGGAC TTTGTTATCA AATAAACGTT CCATATCGGC TCTAGCTTCC 12 96 0 

ATACCAATAG TTTTAATTTT TTGACCGCCC GCACCAATGA CCATTTTTTT CTGAC CTTCA 13 020 

CGTTCAACGA GAATTAAGCC ATTTATTTCA TAAGTGCCAC GTTCATTTAC TTTGAATTGT 13080 

TCGATTTCAA CAGTAACAGA ATAAGGCAAT TCTTCACCGG TAAAACGCAT TAATTTTTCA 1314 0 

CGGATTATTT CCGAGGCCAT AAAACGTTGC GAACGATCTG TGACATAATC TTCTGGAAAA 132 00 

TGATGCACAC CTTCACGTAA AGATTGGCGT ACAATTTTCT CTAATTCATG AACATTATTA 13260 

CCACGCTGTG CAGAAATTGG CACAATGTGA GCAAAGTTAA ATTTACTGCT AAGATCTGTA 13320 

ATAAACGGCA AC AAATC AT C TTTATTCTTG ATGTTATCTA CTTTGTTGAT AGCAAGCACA 133 80 

ACGGGTGCTT TTGCATTACG TAGTTTGTTT AGCACCATCT CATCATCTGC ATTCCAGTGT 1344 0 

GTACCATCTA CGACAAAAAT AATTAAATCA ACATCGCCGA TAGCACTACT TGCGGCGCGG 13 5 00 

TTCATTAAGC GGTTAATCGC CCGTTTTTCT TCAATATGAA GCCCCGGTGT ATCCACGTAG 13 56 0 

ATTTCTTGAT ACGCACCTTC AGTTTTAATC CCAACAATGC GGTGACGGGT AGTTTGTGCT 13620 

TTACGCGATG TAATTGAAAT TTTTTGTCCT AAAATTTTAT TTAAGAGCGT TGATTTACCC 13680 

ACATTTGGGC GACCTACGAT AGCAATAAAG CCGCAATAGG TTTTGTCGAA TTGTTCGGTC 13 740 

ATTTGATGTC CAGTTCTTTT AAAATTTGTT CTGCTGCCGC TTGTTCAGCT TTACGACGGC 13800 

TAGATCCTTT TGCCACAAAA GTGCGGTCAA TTTTCTCCGC ACTTTTCACT TTGCATTTAA 13 860 

CAGTGAAAAT TTGGCAATGG GCTTCCCCTT GAATATTGAC CACTTCATAA GTTGGCAATG 13 920 

G C AAATGTTT GCCTTGTAAA TATTCTTGTA AACGCGTTTT CGCATCTTTT TGATTATCGC 13 98 0 

CTGGTTTAAT TTCTGCCAAT AACTGTTGAT ACCAATTTCG TATCACTTGG GTCGTCACAG 14 040 

CCAATCCTTG ATCCAAAGAC ATTG C AC CT A TGATCGCTTC TACACAATCG GCAAGAATAG 14100 

ATTCACGGCG AAAACCACCG TTCTTTAGCT CGCCAGAACC AAGCGACATA TAATCGCCTA 1416 0 

ATT CAAATTG GCGCGCTAAA ATCGCTAAAG TGGGTTCGCG TACCAAAGTA GCACGCATAC 14 22 0 

GGCTCAGTTC TCCTTCATTA CAACGTGGAA ACTGATGATA AAGTG CTTCA GCGATAGTGA 14 280 

AATTAAGAAT GGAATCGCCT AGGAATTCTA GGCGTTCATT ATGTTGAGTT GCCGCACTTC 14 34 0 

GATGCGTAAG TGCCTGCTTT AAGAGCGCTA TATCATTAAA ACGATAGCCG ATTTTACGCT 14400 

CAAGTCGATC TAAATGATTC ATCTTATTTT ATTGCTGTAA AGAAACGCTC AAAACGGAAA 1446 0 

CCTGTTGGCC ATTCATTCGC TTCTTTTTCT AAGCTCATCC AAATATAAGT GGCTTTACCC 14 52 0 

-77.10- 

SUBSTTTUTE SHEET (RULE 26) 

BNSDOCID: <WO_9633276A1J_> 



WO 96/33276 PCT/US96/05320 



ACAATATTTT 


TTTCAGGTAC 


AAAGCCCCAG 


AAACGACTGT 


CATCGCTGTG 


AT C G C G ATG A 




TCCCCCATCA 


CAAAATACTG 


CCCCTCTGGC 


ACAAGCCATT 


CTGCAGTTTG 


CATTCCCTCT 




TGTGGGAAAA 


ATTCAAGGTC 


TGAATAACGA 


CGATACTCAC 


TAATTAACAC 


G TT ATG TG TT 


i 4 "7 n n 


ACATCGCCTT 


TTTCAGTCAA 


TTCTAATTCA 


TTTGGAAAAG 


CAGGATTTGT 


TGGATTTTGT 

* >J >J«t. X X X X V3 X 


X *k / O U 


GTATATTCAA 


ACGCCTTAGT 


TTCGCAATCA 


ATTTCACAGG 


GTTTCCCCTC 




1 4 Q *5 n 
X *± O Z U 


ACCACTTTTA 


GCGTTTTTTG 


TTCCACATCA 


AAAATGACGC 


GATCTCCGCC 


CT TT C CAAPA 


1 ADDA 

X ** o o u 


ATACGCTTAA 


TATAATCCAC 


ACCAGACATA 


TTATCTTTTG 


AACTTAACGC 


TAAA.TTTTCT 


1 4 Q4 ft 
Xt 74 U 


GCAAAAGCCG 


CTCGTGTTGC 


CCCAAGACCA 


GTACGAATTA 


ACGCTTGTTG 


TGGTGCTTTA 


15000 


AACACAATCA 


CATCGCCACG 


TTGTGGTTTT 


TCGCCTGCAA 


TAATGGTGTT 


TTGGAAAATC 


1 506 0 


GGATCTTTCA 


CACCATAAGC 


ATATTTATTC 


ACAACTAAAA 


AATCGCCAAC 


GCGTAAAGTG 


■x. 3 X 4. v 


GACTCCATTG 


AGCCAGAGGG 


AATTTGAAAC 


GGTT CAAAC A 


AAAAAGAACG 


AACCAAAAAT 


i ci on 

X J X. o \J 


ACCACTGCAA 


GCACAGGAAA 


AAGTGAAGAC 


AAAAATTCTG 


AAGCCTCAGA 


AATCGGTTCA 


1 jatU 


ATTTTGGCTT 


TTTCTTCCTC 


ACTTAAGGTT 


TTACCAGAAC 


GTTGTTCTGC 


ACGTGCCACT 


x, -J j w \j 


TGACGATGAC 


GTTTTGGCAG 


C AC C AC AAAA 


CGATGATAAC 


ACCATAATAC 


GCCAGAAAfiT 


X D .5 O U 


GCGGTCAAAA 


TTAGTAACAA 


AATACTAAAA 


GTATTTGGTA 


ACTGAAAATA 


ATCTAAAAPT 


1 Q4 o n 


TTCCACACAC 


CAAAGCCG AC 


AGCCAATAAA 


ATCACAAAAA 


ATAAATTTGA 


CATAATTTTf 
V— r\-X>vrlX ill*. 




CTTATTTATC 


TTTTCCTACG 


TGTAAAATTG 


CTAAAAACGC 


CTCTTGCGGC 


a f ,p m*r , T*2i f" , G'T 

/^V— X X \_ X X 


i ccAn 


TAC CTAAAGA 


CTTCATACGT 


TTTTTCCCTT 


CTTTCTGCTT 


CTGTAAGAGT 


TTTTTPTT A P 
x xxx i \— u. xn\v 


i t;cnn 

13QUU 


GGCTCACGTC 


ACCACCATAA 


CATTTTGCTA 


ATACGTTTTT 


ACG TAATTG T 


TT CAC C G TAG 


-) c c £i n 

X O O O *J 


AACGAGCAAT 


AATATGGTTT 


CCAATCGCTG 


CTTGAATAGC 


GATATCAAAT 


TGCTGACGAG 


15720 


GAATTAACTC 


ACGCATTTTT 


TCAACCAATT 


CACGACCACG 


AT ATTG TG AA 


TTATPTTTAT 

A X *»• X ^ X X X X 


1 CTRO 
J. ^ f O \J 


GAACGATTAA 


AGCCAACGCA 


TCTACTCGTT 


CGCTGTTAAT 


CATAATATCC 


ACGCGAACCA 


15 84 0 


TATCTGCCGC 


TTGGAAACGT 


TTAAAACCGT 


AATCTAACGA 


AGCATAACCA 


CGCGATGTTG 


15 900 
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CACCATTGGT 


CATTTCAACT 


TCATAAATTA 


CAGTTGGTGC 


TGTGGTAATC 


AAATCAAGAT 


16200 
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CGTATTCACG CTCTAGACGC TCTTGAATGA TTTCCATATG CAGAAGTCCA AGGAAACCAC 1626 0 

AACGGAAACC AAAACCAAGC GCGGTCGAAG TTTCTGGTTC ATAGAAAAGT GACGCATCGT 16 32 0 

TTAAACTTAA TTT AC CAAGT GCATCGCGAA ACGCTTCATA ATCATCAGAA CTTACAGGGA 16380 

AAAGCCCAGC ATAAACCTGT GGTTTTACTT TTTTAAAGCC TGGTAAAACC TCAGTTGCAG 1644 0 

AATTATGTTG ATGGGTTAAG GTATCGCCCA CTGGCGCACC TAAAATATCT TTAATCGCAC 16 500 

AAACAACCCA ACCCACTTCG CCGCATTCTA AAACGGTAGT ATCTTCTTGT TTTGGCGTGA 16 56 0 

AAATCCCTAA ACGATCTACG TTATAAGTTT GCCCTGTGGA CATCACTTTA ATTTT ATCG C 1662 0 

CTTTACGTAA TACCCCGTTT TTAATACGCA CTAAAGATAC CACGCCCAAA TAATTATCGA 166 8 0 

ACCAAGAATC AATAATCAAT G CTTG TAATG GCGCATTAGG ATCACCTTCT GGTGCAGGAA 16 74 0 

TTTTTGCCAC GATTTCTTCT AATACATCTT CGATACCTAC ACCCGTTTTA GCTGAGCATC 16 8 00 

GAACCGCTTC CATCGCATCA ATTCCAACAA TATCTTCAAT TTCTTCAGCG ACGCGTTCAG 16 860 

GATCAGCAGC TGGTAAGTCG ATTTTATTTA AAATCGGTAC TACTTCTAAA TCCATTTCGA 16 92 0 

TTGCAGTATA ACAGTTTGCT AGAGTTTGGG CTTCTACCCC TTGACCAGCA TCAACTACCA 16 980 

ACAATGCCCC TTCACAAGCA GCAAGAGAGC GAGACACTTC ATAGGAAAAG TCCACGTGTC 17 040 

CTGGTGTATC GATAAAGTTT AATTGATAAG TTTCGCCATC TTTGGCTTTA TAATTTAAAG 17100 

TAACACTTTG TGCTTTGATG GTAATTCCAC GCTCACGCTC AAGATCCATT GAATCTAGCA 1716 0 

CTTGCGCTTC CATTTCACGA TCCGAAAGGC CACCACAAGT TTGAATCAAA CGGTCAGAAA 1722 0 

GTGTCGATTT ACCGTGGTCA ATATGGGCAA TAATAGAAAA GTTGCGAATA TTCTTCATAA 17280 

AAGATTGTTT TTCTCAAAAA TCATTAAATT TAACTGGCGG ATTGTACCTG AAAAGCGCGG 173 4 0 

AAATCGCTAG GGATGAAAGG GAAAAGTGCG GTCAAATTTT CAAACTTTTT AGATAAAACA 17400 

AAACCCCGAA TAACTTCGGG GTTTCTTTTT TCATTCAAAC TGAAATTATA GGCTTTCAGT 17460 

GAAAGTACGA GTGATCACGT CGCGTTGTTG TTCTGGCGTT AAAGAGTTGA AACGTACTGC 17 52 0 

GTAACCAGAA ACACGGATAG TTAATTGTGG ATATTTATCT GGATTGTTTA CCGCATCTTC 17580 

TAACGTTTCA CGACGTAATA CGTTTACGTT TAAGTGTTGG CCACCTTCAA CTTTAACAGT 17640 

TGGTGCTACG TTTACTGGTA ATTCACGGTA TTCGATTTGA CCTAATTCGC TAACTGCAAC 17700 

AACTTGGTCT TCAGCGAATT CGCCTTTTGC ACATAAGCAA CGTGCTTCAT TTTTTTCGCT 17 76 0 

GTCTAATAAC CAGAATGAAT TAAGCAAATT GTCATTTGCT GCTTGAGTAA TTTGAATACC 178 2 0 

TTTAATCATA ATAGCCTCCT AATTGGCACG AAAATAATAA TAAAACTGTA CAAATTTTAT 17 880 
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ATGCGACAAG ACGAAGCAAC TCAGCAAAGC TGCAAATTTG ACCATCATGC GTTGGCCCAC 19620 

GATAAGCATT TACATTTTGA C C AAG TAG AT TGACCTCACG CACACCTTGT TCCGCAAGTT 196 BO 

GCGCAATTTC AAAT AG C AC A TCATCTACAG GACGGCTAAC TTCTTCTCCA CGAGTATAAG 19740 

G CACAACAC A AAAAGTACAG T ATT T ATT AC AGCCTTCCAT AATGGAAACA AATGCCGTTG 19800 

GGCCTTCTGC GCGAGGTTCT GGTAAGCGGT CAAATTTCTC AATTTCAGGG AAACTTACGT 19860 

CTACGACGGA ACTTTTTCCA CCACGAATTT GATTAATCAT TTCAGGCAAG CGATGCAAAG 19920 

TTTGCGGGCC AAAAATAATA TCCACATAAG GCGCACGATG GCGAATATGT TCCCCTTCTT 19 980 

GAGAGGCTAC ACAGCCGCCC AC AC C AATC A CTAAATTTGG ATTATTTTTC TTTAATTCTT 2 0040 

TCCAACGCCC AAGTTGGTGG AACACTTTTT CTTGTGCTTT TTCACGAATA GAACAGGTAT 2 010 0 

TTAATAATAA TACGTCTGCT TCTTCAGGTG CTTCCGTGAG TTCTAATCCG TGGGTGCTTA 2 016 0 

ATAAAAGATC AGCCATTTTA GATGAATCAT ATTCATTCAT CTGGCAGCCC CAAGTTTTAA 2 022 0 

TATGTAATTT TTGAGTCATT TTCTACAATA TCGTTAGGTG TTATAAATAA TTAGTTAAAT 2 0280 

AAAGCAATGG GCTATTCTAC AGCTTGCGCT AATTCTGGCT AGTGAATATT CGGTGAAAGA 2 034 0 

CACTAAACAG AATATTTTGA TAAGCTAGAA AGGCTTGTAA ATCATTAAAA AAATAAGATA 2 0400 

TTCATAAATA TACTCAAGAT ATAATGCAGT ATGAGGTAAA TATTAGTTCA TAAAATTAAA 2 0460 

GGAAATACTA GCTTATCGCG ATTACAACGA TGTGTATATT CAAGGTAAAC CAAAAGGATT 2 0 520 

AAGCCCTGTA GATTACAGGA CTTAATCCTT AAATTAAATA GAGCTCACTT TTTAGTGAAA 2 0 580 

GCGCAAGCAT ATATTTATAA TCAACTACTT TGTCTACTAC ACCGTCAAAA TAG CAGTTAA 2 064 0 

ATTTAAAAAA GATTAGCTTA TAACCAGCCT AAACT CATTA ATAATTTCCA CCATGCCATA 2 0700 

CCGATAGTTA GATGGATAAT TAAACTACCA AAAGCAATGA TCGCGCCAGT AATCCACCAA 2 0760 

G ATT TT AT AT C ATT AT AAC C TGCACCGAAT GCAATTGGAC CTGGTCCGCC TCCGTAGTGA 2 0 820 

GTTACTGAAC CACCATAAGA ATTAGCAAAG ACTAAGCCTA GAG C AAG C AA TTCAGTAGGA 2 0880 

GCGCCTGTTA CATGTCCGAC TGTTG C AAAT ACGGGTACCA TTGCAGCTAC ATATG CACCG 2 0 940 

CCTGATGCGA ATAAATAACG TACAGAGACA CTTAAGGTAA GAATCACTAT TAACGCCATC 21000 

AT AC CATGTC CCTCAAACTG AAGTATGGTA CTTAAGGTGT TTGCTAACCA TTTGAAAAAG 2106 0 

CCAGATTTTT CAAGCAAGCC AGACATACCG ATGATACCAC CAT AC CATAC TAGGGTATTC 2112 0 

CATGCGCCTT TGCTTTTTAG GATATCGTCC CAGGTGACGA TACTTAATAC AATACAAAGC 21180 

ACCATTACAA TGATTGCAAC AATGGTAGCA TTGATATGAA GTGAATTAGA GAAAATCCAG 2124 0 
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CCGAATAATG CAATTACGAA TAACACGCTG AGTGCTTTTT CACGAAAAGA CATTGGACCT 213 00 

AATTCTTCTA AGCCTTTTTT CGCAATGGCT TTATTATCCA CTTTTTTCAA TTCTGGTGGA 21360 

GATACCCAAT AACAAATTAA CGGAATTAAG AATAAGCAGA GTAAGCCGGG TACGGATGCA 2142 0 

GCTAAGAACC ATTTGATCCA AGTGGTTTCA AAACCAAGAA TTGGTGCCAT TAATGAAAGA 214 8 0 

GCTAAGGCAT TAGGAGCCAT TGCGGTTAAG AAAATATAAG AGGTAGTTTT AACCACCATA 2154 0 

TATACGTTCA TCATCAAATA ACGCCCTGCT TTTTTCGGGC TTTTTTCTGG ATCTGAACCA 21600 

AGAGCAACCA CTACG CTATT AATGATTGGA AAGATAATCC CACCTGAACG CGCGGTGTTA 21660 

GAGGGCGTTG CTGGCGAGAG TAAAAGATCT AAGAACATGG TAACATAGCC TAAG CGTAGG 2172 0 

GTTGTGCTAC CCATTGCGCC GATCATATGA TAGGCGATAC GTTTACCTAG ACCTGTGATT 21780 

ACAAATGCTG AACTCAAAGT GAATGCGGTA AAAATAAGCC ATGTTGTGCC AGATTTATAA 21840 

CCATCCAACA TATTGCCGAC TTTAACCAAT TCTTTCGCCC CTTCAGTATT ACCGATGATA 21900 

ACG C C AG AAA CCGCAATTGC AGC CAATAAG ATAACGGGCT CGCCATAAGG TTTAAGCACA 21960 

AGCCCAACGA TAGTGGCGAT ATAAACACCA AGTAATCGCC AAGCAATCAA AGAAAG C CCA 2 2 020 

TCAGGAGCAG GTAGTAAGAA AGTGATCAAT GGGATAGCCA TTAAAATCAC CAGTTTTGTA 22 08 0 

TTTTTGGATA ATGC CAT ATT AATTACCTCT AAATGATAAA AAGAAAATTG TTAGTGCTAA 2 214 0 

TAAATC AG C A CTACCGCCCG AACTTAAATT ACGTTCAATG CAGGCGGTAT CAAATTTCAT 2 2200 

TAATGCTTGT GTCAGTGCTG TTTTATCTGT CACAAGGTGT TGATCTGTAA GCAGATCCTG 2 226 0 

TGCTGTTTGC TGTATGAAAT ACAATCCCGC TAAACCACCT CGATGAACCA CGTTAGTGTC 2 2 320 

GGAGTTAATT GCCATTAAAT TCAACAACAA AATCAGTAAA CGATGTTCCC ATTCTAACTG 22380 

ATGGTACTCA TCAAATTGTG GCAACAACGT TTGAATGAGA TT AAAAC CAT TTTCTGCTTC 2 244 0 

GCCACGGACA CCTGTTAAGC CATATTTTTG GAATAAACGC ACTCCAGCCG TACTAGGCAA 2 2 500 

GTGTTCAGGA TAATTTTTCA GTTCATCCGT TAAACCTTGA GTAAATTGTG CAACAAGTGA 2 2 560 

GCAGATTAAT TT AAC AT C AA AATCTACCGC ACTTTGAACC AAACTTTTTT GTGCCAATAA 2 262 0 

TCGTCCAATT GCCGTACACA CTAAAC C AAA AGAAAAAATA GCTCCTTTGT GGGTGTTTAC 2268 0 

GCCATCAGTA ACTTTAAACA TTGCTTTTTC TGCTAATAAA CCTAAAGGAC G AATTT C AG A 2 274 0 

GAGAATTTGA TTTTCAGACA AATGCGCGGT CATCATGCCT TTGAGAACAA ACTGCGTAAA 22800 

AAAAGGTTTT AAACTAATCG CACTTTGCTC AAAAGTATGT AGGTTCATAT CTTTGTGCGA 2 2 860 

ACCGTTATTG ATGGCATCCA CTAAAC CAGG TTTTGGCGAT AAGCGAGCCT CTTGTATTAA 2 2920 
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TG C C AG AT AA ACTTGTTCAC CAATATGTTC TGCGAAATCA TGATTCTGTG CGAGAGATTG 22 980 

AATTTTATCT ACAATCTCAT CAATCTCGTG CTTGTGGGTT CTCGCACAGA TTTTTGCATT 2 3040 

TTCTCCGCAC ACTAAACAAG TTCTTGGAGA AAGATCAAAA TCTGTACGAC TCAGTAGATT 2 3100 

GCCTTTAGCA TTAAACACAT CTAAATCCCA CAAACGAGCA AGAGGAATAC TTTCTTCCAA 2 3160 

TTCAATCATC AGAACTTTTA GCGTACGAGC ATCTATCGGT AATACAAAAT AGGCTTCATG 2 3 220 

CCCCGTTTCT AGCGGACGGA TTATTTCTTT TACTGCTGTG ATGTTTAACT GAGTAAAAAG 232 80 

TGCGGTCAGA TTTTCTAGAG CTTTTGTAAA AACATAATCC AACAAAGCAT TTTTTTTTAC 2 3 340 

TCCACCCACA GCAGTTAAAG TGATGCAAAG TAAGGTTTGT CCATATTGTG TTATAAGCTG 2 3400 

TTGTTGTAAA ATTGCACGTT CCTCTCGAGC ATTAAGCAAT GCTTCAAGTG AAATTTTTGA 2 3460 

ACCCTCTGTA GAGAATGTCG TAAAAAAATG CTGCAAAATT TGACCGCTCT TTTGTATAAA 2 3 520 

AATAAATTTT CTTAACTCTT TCCATTTATA TGGTGCTGGC GAAAGCTGAA AGAGCTAATT 2 3 580 

GAGTCCCTTT CAACTCTTTG GGCTCCTTTC CTGATAATCA TGGGAAGACA AGAGTTAATT 2 3 640 

TTGGCTATTG AAAGTAGATC GCTTGTAATG CTGATTCTAG TCTTTAACTT GAT AC AC CGT 2 3 700 

ATCAATCACC G AAC CAT C AC GATAACGAAC AAC AG C C AC A GGTTTATTAG TAAACTCAAT 23 760 

TTCTTTTGGT TTACCAGTAA TGCTGTAAGC ACGTTCACAA AGTTGTTCAA TAGTAAACAA 2 3 820 

TGGAATGTCT GTTTTGCTTA AAGCTTCAAT CAAATCTGGA CGTTTAGGAT TTACTGCTAC 2 3 880 

ACCATGATCG GTAACGAGAA TATCGACGTT TTCACCAGGT GTTACACAGG TAATCACGTT 2 3 940 

TTCCACCACG GTTGGAATAC GACCACGAAC TAATGGTGCC ACAATGATTG CTACTTGTGC 2 4 000 

AGAAGCTGCG GTATCACAAT GACCACCAGA TGCACCGCGA ATCACACCGT CAGAGCCAGT 2 4 060 

CAATACATTC ACGTTAAATT TAGTGTCGAT TTCTAAAGCA GAAAGAATCA CCATATCTAA 2 412 0 

ACGTTCAACA GATGCGCCTT TGGAACTGTA ATTCG CAT AT TGGTTAGCGG AGACTTCAAT 2418 0 

ATGATTTGGG TTACGGGCAA GTGATTCAGC TGCAACAGAA TCAAAGCTTT GT AC AT CT AA 2424 0 

TAATTTTTTA AT AAG AC C AG CCTCATGTAA TGCCACCATA CTTGCAGTAA TTCCGCCAAG 24300 

TGCGAAATCT GCAGTGATAT TTTCACGACG CATTTTTTCT TCTAGAAAGC GTGTAACAGC 24 36 0 

TAATGCTGCC CCCCCAGAAC CAGTTTGTAA TGAGAAGCCA TCTTTAAAAT AACCACTGGC 2 4420 

AAAAATTACT TCTGCACATT TACGCGCAAT GAGTAATTCG CGTGGATTGG TAG TC ATTCG 2 4480 

GGTTGCACCA CCACCAATTT TTTTCGGGTC ACCGACTGCT TCAACTTGCA CGATTAAATC 24 54 0 

TACTTGATCT TGAGCTATGC TGATTGGGTG GTGCGGGTAT TCAACAAATT CCTCTGTGAG 246 00 
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CAACACAACT TTGTCAGCAT ATTCAGCATC GACACGCGCA TAGC CTAGAG AGCCGCATTT 24 660 

ACTTTTTCCA GTAAATCCAT TGGCATTACC GAATGTGTCA CAGCAAGGCA CGCCTAAAAA 24720 

AGCAACGTCA ATTTTTAATT CGCCAGATTT TACTAAGTGA ACACGTCCAC CATGAGAATG 24 780 

AATATTTACA GGTTCATTCA ATAAACCGCG AG AG ATTTG C TCTGCAAGTT CACCGCGTAA 24840 

ACCAGAAGAA TAGATTTTGG TGACCACGCC ATTTTTGATA TGTTCGACAA TCGGGAAGTG 24 900 

GCTATCAATT AAAGATGAAG ATGCAAGGGT CAGATTTTTG AAACCCATTT CGGCAATTTT 24 960 

ATTCATCACC ATATTCACCA CAAAATCTCC GCCCCGAAAA GCATGATGAA ATGAAACAGT 2 5020 

CATACCATCT TTTAAGCCAG AGCGTTTAAT CGCTTCTTCT AGAGAGGTAC AAAGTTTGCG 2 5080 

ATCTTTCGCT GTACGAGAAA GGGATTCAGA TTTAGGCACC GCTTGATAGA CTGAGCGATC 2 514 0 

AGCGTGATAT TTTTCAATAC GTTGTTCTCT TGTTGTCATC TTTTTATCTC CCTTTACTCT 2 5200 

TCACGAATAC CTGATTTCGC ACGTT CTAAT ACAAGTTTTG CGCGATCAAT AATCGGTGCG 2 5260 

TCAATCATTT TAC CGTTTAA CGAAACAACA CCAGCCCCTT GTCGTTCAGC TTCTACTGCC 2 5320 

GCTTCAATAA TGCGTTTTGC CTGCTCCACG TCTTTTTGTG TCGGTGCAAA TAAGTTGTGT 25380 

AACAATTCAA TTTGGCGTGG GTTGATTAAG GATTTACCGT CAAAACCAAG CTGTTTAATT 2 5440 

AACGCAGCCT CTTTTAAGAA GCCTTCTTCA TTGTTCGCGT TAGAATATAC GGTATCAAAA 2 5 500 

GCCTGAATGC AGCCGCACGT GCCGCTTGTA AAATAGAACA ACGAGCAAAA AGCAGTTCAA 2 5560 

TCCCCTCTGG TGAACGCTCG GTTTTTAAGT TACGTACGTA ATCCTCTGCA CCCAATGCGA 25620 

TACCAATTAA ACGTTTAGAA GCGAAAGCGA TTTGGTTGGC TTGAGTGATA CCTAATGGCG 25680 

ATTCGATTGC GGCTAGCATT TTAGTCGAAC CTGCTTCTCG ACCGCAAGCT TTCTCAATCT 2 5740 

CAGTAATGGC GTGATCCATA TCCAACACAT CTTGCGCGCT TTCAGTTTTT GGCATACGTA 2 5800 

CCACATCCAC ACCCGCACGC ACTACCGAAT TCAAATCCAA TAAACCAAAT TCAGAATCTA 25860 

GCGGATTAAC ACGAACTACG GTCTCAATTT CTTTATAAAG CGGATGTTGA AGTGCATGAG 25920 

CGACTAATAA ACGAGCAGAA TCTTTTTCTT TGAGAGCAAC CGCATCTTCT AAGTCAAACA 2 5 980 

TAATGGAATC TGGTTTATAA ATAAAGCTAT TACTTAACAT GGCGGCATTT GAGCCCGGCA 26040 

CAAATAACAT ACTTCTTCTT AATTT CAT AG CACAGCCTCC CAGTTAATCG CTTCATCAGT 26100 

TGCACGCATT GCAGCAGCTT TTACTCGAGC TTGTAACACG CAATCTAACG CCCCTTTGTC 2616 0 

TTCTACAATG ACTTGTGCTG CTGTAATGCC TAGTTTTGCT AATACTTCAC GAACTGTCGC 2622 0 

TTCAATTTGT TCACCAAATT GTTTTGCTAC TGAGCTATTG ATTTCAATAT CGAGTGAGTC 262 80 
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AAAAGGTTGA ACCCGTACTT GAACATCACT GGATTCCAGT GTCCCTGCAA CGGCTACTTT 2 6 340 

TGTTATTTTC ATAGATTTTC CTTCCGTATA TTGATGAATA AAAATGTATT TAAAAACGAC 2 6400 

CGCACTTTTG TAAATAATTT AACGTAGTCA TAGG CACAAA CTCTGCGAGT TGAGCCCAAT 26460 

TTTTTTCAGC CAAATGCTTT CGTACTGTAG AAGCAGAAAT AATGTGGTTG CTTGCTGTTT 2 6 520 

TTCTAGGTAT CTCGATAACA TTTATTTTTG GTGCATTCAT TTCGGCATCC ATTAACCAAT 26580 

AATG C ATTTG GCGATTATAT TCGGCAGTGA CGGGACAATT TAGCTCAGTA CCAACAAAGC 26640 

GATGTGTTAT TCCCAAAGCT TGAGCAATAT GTAAACGGAA TAACTTCAGA TCAATCTCAA 2 6 700 

AATAACTTTC ATCTGTAATT AATTGATCTT TTAGGAAATA ATTTGGAAAT GTTGCACGAG 2 6 760 

AAATAATATA GTCAGAGCCA GAATGAAGAG TGATATTGGA TAAATCGAAA ATACCTTGTT 2 682 0 

GAATCATTTC AAATCGTTCA GTGTAAGAAA ATTG CGATGC GTCTTCACCG ACAATAAATA 26 88 0 

AATGCAGATG ATCACATTGT TGTAATGCTT GCTCAATTAA ATAACGATGA CCTAAGGTAA 2 6 940 

ATGGATTCGC GTTCATTACA ATCGAGCCAA TTCGATTACC ATCCACACGC ATTTTCTCCC 2 7000 

ATAATGAGCA TTGTTTTTGT AAACGCGTAG CACTATTTTC TAATAATACT ACATAAGGAT 2 7 060 

TGGCATCAGA GATGATATAA AAACCACAAG ATTTGAACAA CGTTG CATAT TCGGGTTTTG 2 712 0 

TATAAATAAA AAGGTGTGGA CGTTTTAGCG TGTAAGCTAA ATCAACCAAT TCAGTAATTA 2 718 0 

ATTG CAAGG C AACACCGCTA CCACGCAAAG ACTCATCAAT CGCAACACAT TTTATGATAT 2 7240 

TGCCTGCTAA ACCGCCACAG ACAACGATTT GTTCGTTATC ATTATAGCCA ACAACAAAAT 2 73 00 

ACTCGATTTG TTCATCTAAT TTAAGCGCAT TTTGATGTAA AAATGTCTGA ATGAGAGAAA 2 736 0 

GTTTTTTTTG TTCAGTAGAA ATTCGCTCAA ATTGCATCGA CTCTCCAAAA AAGAAGAAAA 2 7420 

CATAACATCA CTACCAAGTT AGTAGTATAG AGCTATTATA AGCGATGAGC AAATTACGAT 2 7480 

ACAAGATAAG CTCATATTTT TCCCAACTTT ATAATCTAAT ATT AT CTTC A CACTGCATTA 2 7 540 

TACATTGAAT AATTTATAGG CAATATTGGC GGTTATTCTG AGTTGTAAAA ATAAaCCGCA 2 76 00 

CTTTTGTTGC AAGCAAAAAG TGCGGTTTAA TTTAGCGGAA TTTTAAATTA TTTCACTAAT 2 7660 

CCGCCACTGG CTTGTAAATC AGCGTGATAA GAAGAACGAA CAAATGGCCC ACAAGCCGCA 2 772 0 

TGTTCAAAGC C C ATTT CAT T GGCTTTATCA CGGAATTCAT CAAATTCAGT TGGCGGCACA 2 7780 

TAACGGGCAA CAGGCAAATG ATGGCGGCTA GGTTGAAGAT ATTG AC C C AA CGTGAGCATA 2 7 840 

GTCACACCGT TATCACGTAA ATCTTGCATC ACTTGCAAAA TTTCCTCGTT GGTTTCGCCT 2 7 900 

AACCCAACCA TTAAACCAGA TTTGGTTGGG ATATTTGGGA ATATTTCTTT GAATTCACGC 2 7 960 
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AATAATTTAA GTGACCATTC GTAATCCGCA CCTGGACGAA TTTCTTTATA TAAGCGCGGT 28020 

ACGTTTTCTA AATTATGGTT AAAGACATCT GGCGGATTAT CTTTTAATTT CTCTAAAGCC 2 8080 

TGTGTAATGC G AC CACG AAA ATCAGG C ACT AAAATTTCAA TCTTAATGTT AGGATTAAGC 2814 0 

TCGCGCACAG CTTTTACACA TTCAGAAAAA TGACCAGCAC CACGATCAGG CAAGTCATCA 2 82 00 

CGATCCACCG ATGTGATTAC CACATATTTA AGTTTCATAT CTTGAATCGT TTCTGCCAAT 2 826 0 

TTTTGTGG CT CTTCAGGATC TGGCGGTAAT GGTTTACCGT GAGCCACATC ACAGAATGGG 2 8320 

CAACGGCGCG T AC AAATTG C AC CTAAAAT C ATAAAAGTGG CTGTGCCATG ATTAAAACAT 2 83 80 

TCGTGTAAAT TTGGACAAGA TGCTTCCTCA CAAACAGAAT GTAATCCGTG ACGGCGCATT 2 8440 

C CATTTTT AA TGCTTTCAAT TTTAGCTGAA CTTGCTGGAA GTTTAATTTT CATCCATTCT 2 8 500 

GGTTTTTTTA ATAAATCTTG ATTAGGATCG ATATTTTTtA CGGGAATAAT TGAGGTTTTT 2 8 560 

GCCGCATCGC GATATTTTAC ACCGCGTTCC ATTTTGAAAG GCGTAGACAT AGTATTTTCC 2 8620 

TAAATAAAAA TGCCCAAATT GGGCATTTAG TTTTTATAAA ATGTTATAAA GTTGTTATAT 2 8680 

TATAGC CTAA AAGATTGGCG AAGTGTTTAA TTAATTTTGC TGAAACATTA TCGCAATCAG 2 3 740 

CTTTATCTTG ATTAACAAAA TCCGCTAACT GACACATTTC AAGCCCTGCA TAACCACAAG 2 88 00 

GATTAATATA ATGGAAAGGA TTAAGATCCA TATTAATATT CAATGCCAAC CCGTGAAAAG 2 8860 

AGCAACCACG ACGAATACGT AATCCAAGAG aACAAATCTT TTTTCCATCC ACATAAACAC 2 8 920 

CTGGCGCATC AGGCTTTG G A TAGCTTTCAA TGCCATATTC AGCCAAAGTT TTAACCACCG 28 980 

TTTGTT CAAG TGCGGTAACT AATTGACGTA CATTTAAATT TTTATGGCGT TTAATGTCAA 2 9040 

TAAGCACATA CATCACTTGC TGACCGGGTG CGTGATAGGT AATTTGCCCA CCACGAACCG 2 9100 

ATTGCACCAC AGGGATTTCA CTACGTTGTA AAAGATGCTC TGGTTTACCT GCTTGCCCTT 2 916 0 

GTGTAAAAAC AGGATAATGT TGTACTAACC AAATTTCGTC TTGCGTTTCA GCATTTCGAG 2 9220 

TATCAG t AAA ATCCTGCATT TTGTGCCAAA TTTCTTGATA ATCTTGCAAT C CTAACTG AC 2 9280 

GAACGATAAG GGAATTGnTC ATACTAAATA ACCATTTTCA CGCCTTCAAC TTTAGCCAGT 2 9340 

TCTTTATAGA GTGTTTCTAC CTGATCAAAG TTTTCAGCAA TAATATCAAT TGAAACTGAA 2 9400 

TT AT AAGTG C CTTTACTACT GCGCTTTTCT TTTGGAATAT AATCGCCTTT AATATATTTT 2 9460 

TGCACCACTT GTATAAGATC TTGTGCTAAC CCTTCTCGAT TAATTCCCGC AACTTTAAAA 2 952 0 

GTCATTTTTG CTGGAAATTC CATTAATTCC TTGAGTTTTG CATAATCGTT TTCTATTGTC 2 9 580 

ATTTTCTTTC CTAAAAATAA AATAAATCAA AAAGTGCGGT CAATTTTCAA ATTCTTTTTA 2 9640 
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TGAGAATAAC CCTTTCACAG TCAATACTAA CCAATCCCAT AGTTTACCAA AAATACCCGC 2 9700 

TTCTCCCACA TCATTCATCA CTTGTAAATT GGCACTTGCA ATATCCTTAC CATCCAGTTG 2 9760 

ATAAACCACT TTACCAATCA CTTGACCTTT TACTAATGGT GCTTGTAAAT TTTTATCTGC 2 9820 

TAATTCATAA CGAG CTTTAA CCTCACTTTG TTTAC CTTT A GGGATCGTAA TAAAGTGATC 2 9880 

C ATTAAAG C A C CAAGTTTTA CTGAATTTTT ATCGCC ATAA TAAACCCTTT GTTCTGAAAT 2 994 0 

TTCTTTACCA GCCTCTAAAG TTTTAAATGT TTCGAAATTA GCAAAACCCC ATTGTAAAAG 3 0000 

TTTCTTACTT TCTACTTCAC GACCTTTGTA AGTTGGCACA CCCATTACTA CAGAAATTAA 3 0060 

ACGCATATTG TTTG ATG TAG TCGCTGAAGC GACTAAGTTA TATCCTGCTT GGCTAGTATG 3 012 0 

CCCTGTCTTC ATT C CAT C AA CATTGATAGT TTTATCCCAT AATAGTCCAT TACGGTTAGC 3 0180 

TTGTTTGATT TTATTAAATG TAAAGTTTTT TTCGGAGTAA ATTTTATATT CTtCAGGTAA 3 024 0 

ATCACGAATA ATATGCGCAC CAATAATTGC CATATCACGC GCGGAAGAAT ATTGATTTGG 3 03 00 

ATCGTCTAAA CCATGGGGCG TGGTGAAATT TGTATTTTTC AAAC CAAATT GTTGAACATA 3 0360 

TTTATTCATT GTTTCTACAA AGTTCGGTAC ATTGCCAGAA ATATGTTCAG CTAATGCAAC 3 0420 

AGTCGCATCA TTACCAGAAA CAACAATCAC ACCTCGATTT AAGTCTGCTA CAGATACTTG 3 0480 

TGTATTTAAA TCTAAAAACA TTTTTGATGA ATCAGGGAAA TTACGCCCCC ACGCACTTTC 3 0 540 

ACCAATGGTA ACCATATCGG TATTATGGAT TTTGCCTTGT TTTAATGCGA CACCAACTAC 3 0600 

ATAACTTGTC ATCATTTTAG TCAGTGATGC TGGATATTGA CGTTGATCTG GATTAAGTGC 3 0660 

GGTCAAAATT GCACCTGAAT TATAATCCAT CAATACATAC GTTTGAGCCG TAATTTGTGG 3 072 0 

TGGAGTTACG CCAAACTGCA TATCCTCTGC AGAT AC CGAA AAAGCACTAA GTGCCACAAA 3 0 780 

ACTTGATAAA AATG CGATTT TTGTTGTTCT TTTCAACATA GTGTGTTCCT ACAATTAATT 3 0840 

TTTATACGTA TAAACAATTA ATGGCTTATT ATTTGCCATT TTTTGTAATT TAGTTCTCAC 3 090 0 

TTGTG CCATT TTTGTTTTGT CATCAAAAGG CCCAATATAA ATTTCATATT TATTTCCTGA 3 0 960 

G CGATTT ACT TCAGTCTGAA TATTCGCTAA CGCAAGTTGC GTAATTAATT TATTTGCTTG 310 2 0 

GCTACGAGAG GTTAATTCAA G CATTTTC AA ACAGTAAAAT TCGGTTCCTT TT AG AG C ATT 310 8 0 

TATTGACTTA GATGTGTTAT CAAAAAGTGT GTTAGATTTT AAT ACT AAA C GATCGGCAGC 3114 0 

TTCCTGAGTT TTTGCTTGTT TAGCCAGTGT TTTTGTGGCT GCGCCTGATA AATTC CCATT 312 00 

TTTAGCAACG TGCAACGCCT CAATTCTTAC CTGCCCTATG CCGCGGGAAA TTAAGCCAAT 3126 0 

TTCTTTTGCC GCAGCGTGTG AAAGATCAAT TAAACGTTTA TCACTAAAAG GCCCACGATC 31320 
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ATTAATACGT ACGATAACTT TACGATTGTT ATGTAAATTT GTCACTAAAG CATAAGAATT 313 80 

TATTGGCAAT GTTTTATGTG CTGCAGTAAA TTGTTTTGAG TTATAAACAT CGCCACTTGC 3144 0 

TGTTTTGCGC CCATCAAATT TAAGATGATA ATAACTTGCT AAACCTTCTT TAATATAAGA 31500 

TTTGGCTTCA TTCCCACTTT TAGTGGTATA AGTTTGCCCT TTTACTGAAT AAGTCCTTGG 31560 

TGCAGGCATT TGAGTTGCGA TAGAAAGATT ATCCCCTCGA ATACCATACA TTTTTTGCGT 31620 

ATCAGCCTGT GCAGGAAAAG CTACGCTTAT CATAAATAGC AAAAGTGCGG TCAAATTTAA 31680 

CCCAGTTTTT AATTTCATAA AATATTAACG CTTTTCATTT ATTCTTTCTT TAAGAAAGAA 31740 

AAAATGAAAT TAGCTCCTTT TCATAAATTG ACTTTTATGC GTATGAATAG ACATCAGTAA 31800 

AC CAAAACTT GCCATAATCG CAACATAAGA GGTACCACCA TAACTAAATA AAGGTAATGG 318 6 0 

CACACCCACA ACAGGCAAAA TTCCGCTTAC CATTCCAATA TTCACAAATA CATAAACAAA 31920 

GAAAATTAAT GTTGTTGCAC CAGCAAGAAT ACGCCCAAAA GAAGTTTGAG CATTTACGGC 31980 

AATCATTAAA CCTCGCACAA TAATAAACAA GTAAATCGCC ATT AAAAT C A AAAAACCAAT 3 2 040 

CATTCCGTGC TCTTCACCCA TTACCGCAAA AATAAAATCT GTATGAGGTT CGGGCAAAAA 3 2100 

TTCTAATTGT GATTGAGTCC CCTGCATCCA GCCTTTTCCG CTTAATCCAC CAGAACCAAT 32160 

CGCAATCTTT GATTGTAAAA TATGATAGCC AGCACCGAGA GGaTCTTTTT CTGGGTCCAA 32220 

TAATGTGAGA ACGCGAGTGC GTTGATAATC GTG C ATT AG A TATAACCACA TGATCGGAAT 3 228 0 

AAATCCAGCT AAGCCAATCA CTGCCGCTAA AATAAGCCAC CAGCTCATTC CTGCTAAAAA 32 34 0 

CACGACAAAT AAACCTGACG CACTGACCAA AATTGATGTG CCTAAATCAG GCTGAATCGC 3 2400 

CACAAGCAAT GTTGGCAGTA GAATCATTGC AATCG CTAT A AATGTTTCAC TTAATTTAGG 3 2460 

TGGAAGCGGG C G ATTACCT A AATACACCGC CACCATTAAT GGCACAGCAA GTTTTACAAT 3 2 520 

TTCAGAGGGT TGAAAACGAA TAAATCCAAG ATCAAGCCAA CGCTGTGCAC CTTTACTTGT 3 2 580 

CGTACCAATA ACATCTACCA AGATCAATAA CACAAAACCA ATTAAATAAA GATAAGGTGC 3264 0 

GATGCGTTGG TAAAATCTTG GTGGAAATTG AG CCATTAAC AGCATAACAA TAAATCCAAG 3 2 700 

TAAGACCTGA ATAATTCGGC TATTAAACAT CGTTTCGCTG GCACCTGATG CACTATATAA 3 2 760 

CACCAGCATA CCATAAGCAG TTATTG CAAG AAGTCCGATA AAC AG C C AAA AATCAATATG 3 2 820 

CAAACGTTGC CAAACACGCA ACCAAAAAGG GATTTTATCT TCCATTTATT TGTTCGCTCT 3 288 0 

CTTGTGACAA ATCTTTTTTT TGCTGAATAG GTACATTATA TTTTTCAACT TGTGGTAAAC 32 940 

GCTGATTTAA ATAGTAATCC ATCACTTTAC GTGCTAAGGG TGCGGCATTA CTTGAACCAC 3 3 000 
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CACCTGCATT TTCTAAAATC ACTGTTACAA CTAATTTAGG ATT AT CAT AA GGGGCG TAAG 3 3 060 

CGGTAAACCA AGCGTGATCA TGCAATTCTT TTTTTAAGCC TGCCGTATTA TATTTTTCAT 3 3120 

TTTCCTTTAA ACTAAACACT TGTGCAGTGC CAGATTTCCC TGCAACACGA T AATTTG CAT 3 3180 

CTGCAAAGGC TTTTCGCCCA GTTC CATTTG CAGCATTAAC AACATTGTAC ATACCGCGTT 3 3 240 

TCGCTGCCTC CCAAGCTGCC ACTTTAGGTG TATTAATATC AGGATAAAGT AACGGATCTT 33 300 

CATAAGG CTC TAATACTGCA CCTTCAATTG CTTTCATCAA ATG CGGTGT A TTTACTTTGC 3 3 360 

CATTATTGAC CAATATTGTC GTCGCTTTTG CTACTTGTAG TGGTGTTGCC GTCCAATAAC 3 3 420 

CTTGACCAAT CCCAACCGAA ATCGTATCGC CTTGCACCCA AGGGCGTTTA TAACGTTTTT 33480 

GTTTCCATTC TCGAGTTGGT ATATTGGCAG CCGTTTCTTC TTGAATTTCA ATCCCCGTTG 3 3 540 

GCATACCAAA ACCAAAATCC TTCATCCAAT TAGAAAGGCG ATCAATACCC ATATTATAAG 3 36 00 

C C ACTTG AT A AAAATAAGTA T C AG AGG ATT CAGTAATCGC TTTATTCAAA TCCGTATCCC 3 3 660 

CGTGGCCTGT TTTTTTCCAA TCACGAAAAC GTTTTGTGGA ATTAGGTAAT ACCCAATATC 3 3 720 

CCGGATCAAA AATCGTTGTG TTTGGTGTAA TCACATTTTC AGTTTGTGCG GCTACTGCAA 33 780 

TAAAAGGTTT GACTGTCGAA GCGGGCGGAT ACGCACCTTG TGTCGCACGA CTATAAAGTG 3 3840 

GACGAGCAAG ATCATTTAAT AAACGTTTAT AATCTTCGGA GGAAATACCA TCAACAAATA 3 3 900 

AATTATTATC ATAACTTGGT G TAG AT AC C A TCGCTAATAC ACTACTATCT TTCGGATCAA 3 3 960 

GCACAACTAC AGCACCTTTT AGTCCCGATA AAAGTTCAGT AATATAACGC TGTAATGCCA 34 020 

AATCAATGGT TAAATGAATA CTTTTTCCCG C C AC AG C AG G TCGGCTACGC AAAGTG CGGA 3 4 080 

TTACTTTGCC GCGATTATTA ATTTCTACTT CTTCAAAACC CGTGGTACCT TGCAGTATAT 3 414 0 

CCTCGTAATA ACGTTCAATA CCCAATTTTC CAATATCGTT CGTTCCCGCA TAATTCGCGA 342 00 

ATTTATCCTC TCGTTTCAAA CGTTCTACAT CTTT AT CAT T CATTTTACCT ACATAGCCCA 3 4260 

AAATATGCGC CATTGTTTCG C C AT ATAAAT AATGGCGTTT GAAATAAGGA CGTACCTCTA 34 320 

AGCTCGGGTA TTGGTAGC C A TTTACCGCAA AGCGTGAAAT TTGTTCTTCT GTTAAATTCG 3 438 0 

GCTTTAATAA AATCGGTGTA TAACGCGTTC CGCGGCGACG TTCCTTTTTA AAATTTTCAA 34440 

TATCGTTATC CGTCAATCCC ACAATATAGC GTAATTCATC T AAAGTG CGG TCTAAATTTT 3 4 500 

CAGTTTTTTC TGGCACAATG TATAAACCAA AAAAAGTAAG ATTTTCGGCT AATAATTTGC 34 56 0 

CGTAACGATC ATAGATTAAG CCACGTGTTG GCGGTAAAGG TAATAATTTA ATTCGGTTAC 3462 0 

CATTAGAACG CGTTTGGTAA GTATCGAAAT TAACAATCTG TAG CTG ATAA ATATTGGTGA 34 680 
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ATAACACGCC TGTTAAGAGC AGAATCCCCA AAAAAGCCAC TAACGTTCGG CGTGCAAAAA 3 4 740 

GGTTACGCTC CGCTTTTTTA TCGCGGAACG GTTCGTGCGT TGGTGTATTG AAGAATTTTT 34 80 0 

TT AAATT CAT CTGACTAAAA TAAGGATGTA TTATTCTCGA TGATAAGGAT GATTAGTGGT 34 860 

TAGCGAC C AC GCACGATACA AACTTTCAGC CACCACGACA CGAACAAGCG GGTGAGGTAA 3 4 920 

TGT CAAGGG A GAAAGCGACC AACTTTGCTC TGCAGCAGCT TTGCATTCTG GCGAAAGCCC 34 980 

CTCAGGCCCA C CAATCAATA AACAAACATC GCGACCATCA TTTTTCCACG CTTCTAGTTG 3 5040 

CTCAGCTAGC TGCGGCGTCG TCCAAGGTTT ACCAGGAATA TCTAACGTCA CGACTTTGCC 3 510 0 

TTTTCCACAG GCCGCTAACA TCGCTTTACC TTCTTGTTCT AAAATGCGTT TAATATCAGC 3 5160 

ATTTTTTCCA CGCTTGCCTG CTGGGATTTC AATTAGTTCA AAAGG CATAT CTTTTGGGAA 3 5220 

ACGGCGTTGA TATTCCTCAA AACCCGTTGT AACCCAAGAA GG CATTTTTG TTCCGACGGC 3 52 80 

AATTAACGTG ATTTTCACAA GTCATCCTTA TCTAAAAAGT GCGGTTATTT TTTACCGCAC 3 3340 

TTTACTAAAA TTTACGCCCA AAGTTTCTCT AATTGATACA TTTCACGGGC GTCACGTTGC 3 54 00 

ATGATATG T A CGATAGCTTG ACCAAGATCC ACCACAATCC AATCTGCAGT ATTTTTACCT 3 5460 

TCTTCGCCAA AGGTTTCAAA ACCTGCTTTT TTACATTCTG TAATTAAATT ATCTGCCATT 3 5520 

GCAGAAACTT GGCGACTCGA CGTCCCCGTG CAAATAATCA TATTATCCGT AATCGAGGAT 3 5580 

TTTCCACGTA CATCAAAATG TACAATATCT GTt CCCTTTA AC CCATCTAA GGTTTCCATC 3 5640 

AAAAATTCAA CTAACGCCAT TTTGTTCTCA CTTAG CAATA AAGGTGCGGA ATTCTATCAT 3 5700 

TGAATAC CCT CTTGCGCAAA ATCCACGCAA AAAATAAGCG CGAAATTAAC CGCG CTTTTT 3 5760 

ATGTCTAAAT TGAAATTATT TTTACTATGC TGCCCATAAT AACACTGCCA ATAACTGTGG 3 5 820 

CGACATTATT CGCAAGAACA TCACCAATGG ATACACAGTT G C AT AAG AAA GTGCTGCAGC 3 588 0 

ACCGTTATCT TCTTTAATTT CATTCGCAAA TGCTAAAGCG GGAGGwTCCG TCATTGAACC 3 5 940 

TGCAAGTAAA CCGCATATTG TGAGGTAATT AAGTTTTCCA TATAAGCGTG CTATAGTCCC 3 6 000 

AACAATAATC AACGG C AC AA AAGTGATAAA AATACCGTAT CCCATCCATT CAAGTCCCGA 3 6 060 

ACCATTCACT AAAGTATCAA AGAAACTTCC AC CTG ATTTT AAACCAACAA CCGCCAAGAA 3 6120 

AAGGACAATC CCAATTTCGC GCAATGCCAA GTTAGCACTT GGTGGCATAA ACCAGTAAAG 3 618 0 

TTTACCAATC GTGCCAATTC TCGCTAAAAT CAACGCAACC ACCAATGGCC CGCCAGCAAG 36240 

CCCTAATTTT AACG CAAC AG GGAAACCTGG AATATAGAAT GGAATCGAAC CAACGAGTAC 363 00 

CCCTAAACCA ATACCGATAA ACACTGGTAA CATTTGCACT TGAAGTAATT TCTGTTGTGC 3 6 360 
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GTTACCAATA ACTGAAATGG CCTGATTTAG TACATCACTG CGTCCAACCA TATGTAGCAC 3 6420 

ATCGCCAAAT TGCAGTGTGG TATTTCCCGT TGGAACTAAC TCAATCCCTG CACGATTTAA 3 6480 

ACGTGAAATT ACAACCCCAT ATTTTTGATG AATGCCTaAC GCACGGAtCT TTTTGCCTAA 36 540 

TAnTTTTTCA nTCGTTACCA CCACACGCTC GGAACGAATT TCGCCACTAT AGGCAACGGT 36600 

TGGTGCATCA ACTTCGTGTC CGATAATTAA ACGCAnTTTA GCAAGCGAAT T AT CATC AC C 3 6660 

CACTAATTGT AAAATATCGT TGGTACGAAT CTCAGTACTT GCCTTTGGAA CAATTTCCAT 36720 

ATCATCGCGT TTTAAACGTG AACATACGAC TTCTTCATCG CTAAATCCCG GAATTTGAAT 36 780 

TAACGTCAGA CCATCTAGAT TTTGGTTCGT CACTTTTAAA GAAATATTAT GTAAACTTTC 3 6 840 

TTTATCTTGG CTGCTCTCCG CATTAAAACG AGCAGCTTCA TCGTCCACTT TGACGTTAAA 3 6 900 

AAACAAGCGG ATTAACCACA TTGCAAGCAA AATACCGCAA ATCC CGAATG GATAAGCCAT 3 6 960 

TG CATAAG AC ACCCCCATAG TTACAGTGGT TTGAGGCACC CCTAATTCTG CTAAAATTTG 3 7020 

CTGCCCTGCC CCCAACGCAG GCGTATTAGT AACAGCACCA GAATAAATAC CTAAGGCAAT 3 7 080 

ATCCAGTGGA ACATCCGCAA TTTTATGTAC AAGAACCACG GCAATTGACC CCAGCAGAAT 3 714 0 

AATCAAAATG G C AAAGG CAT TTAGCTTTAA ACCAGATTTT CGTAAAGAAG AAAAAAAGCC 3 7200 

AGGTCCAACC TGAATACCAA TAGTATAAAC AAATAAAATC AAACCAAATT CTTG CACAAA 3 7260 

ATGCAGCGTA TGCGCATCCA GTTTCAAGCC ATATTGATTA GTGAAATGGG CAACAATAAT 3 7 320 

GCCGCCAAAC AGCACGCCAC CAATGCCAAG CCCAACCCCG CGGATCTTCC AATGCCCGAT 3 7 380 

C CATAAG CCG ATTACGGCGA CGAGGGCAAG AAGACTAATC GTGATCGCAA TATCACTCAT 3 744 0 

ATTGACCCCT TAAAAATGAA ATTTAACTCC TTCGGATTAT ATCCAAGTTG TTTTTTGAAA 37500 

ACCTTAACCA AACTCAAAAA ATGATACGTT CGTCACATTT TTTCTTGCGA AATTGAGCAT 3 7 560 

AGCCTCTACA ATACCGAAAA TTTTTGACCC TTTTACAAAA AGTGCGGTTA TTTTTCACCG 3 7620 

CACTTTTAtC TCGGATGGAA TATTTATGGA TTACAGCCAA GAAGCTATTA CTTCTCTTAT 3 7680 

TTGGATTTTA CAAACGCTAG CCATCACCTC AGTCGTCTTT AGTTTTGGCA TTTTTCTTTT 3 7740 

GGTGCGTTTT ACCCAATGGG GAAAACAATT TTGGATGTTT GCAGGCGGTT ATCTTTCACC 3 7800 

TAAACGTAGC ATTAAACCTA TCTTATTCTT TCTGCTTATT GTTGCAATGA CATTACTTAG 3 7 660 

CGTACGTATC AGCCTTGTAA ATTCTGAGTG G T AT AAAAAT ATGTATACCT CATTGCAAGA 3 7 920 

ATTTAATGAA CATGTATTTT GGCAACAAAT GGGCTTATTC TG CGTCATCG CAGCAAGTTC 3 7 980 

TGTTTCAGCA GCATTAGTAA GCTATTATTT AGAGCAACGT TTCGTCATCA ACTGGATTGA 3 8 040 
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ATGGCTTAAC GAG C AG CT TG TTAATAAATG GATGGCACAT CGCGCCTATT ACAAAACACA 3 810 0 

GTATTTATCA GAAAATCTTG ATAACCCCGA TCAACGTATT CAACAAGATG TGCAATCTTA 3 816 0 

TGTAAAAACC ACGCTTTCTT TAAGCACTGG CGTCATTGAT GCGGTCACCT CGATGATCTC 3 8220 

TTACACGATT TTGTTATGGG GATTAGCCGG TCCAATGATA GTGCTTGGTG TAGAAATTCC 3 8280 

GCATATGATG GTATTTTTAG TGTTTGGTTA TGTCATTTTC ACAACCCTCA TCGCATTCTG 3 5340 

GCTCGGTCGC CCATTAATCT CATTGAATTT TATTAATGAA CGCCTAAACG CAAACTATCG 3 8400 

TTATTCTTTA ATTCG CATTA AAGAATATGC TG AAAGC ATT GCTTTTTATG CAGGCGAAAA 3 8460 

AGTCGAAAAA AATCAGCTTT ATCAGCAGTT CAACGCCGTC ATTCACAATA TGTGGGTCAT 3 8 520 

TATATTTAGA ACACTAAAAT TTTCAGGATT TAACCTAGTC GTAAGCCAAA TTTCTGTTGT 3 8580 

TTTTCCATTG TTAATTCAAG TTGGACGTTA CTTTGAAAAG CAAATAAAAC TAGGCGATCT 38640 

AATGCAAACC TTGCAAGTAT TTGGTCAGTT ACACGCAAAT CTTTCATTCT TCCGCAGTAC 3 8700 

TTATGATAAT TTTGCAAGCT ATAAAGCAAC GCTAGATCGT TTAACTGGGT TCTGTTATGC 3 876 0 

CATTGAAAAA GCAAATAATA AATCACAAAC ACAGATCCAC AATCATCCAA CAGATGTGAT 3 8 820 

TTTCAAAAAT TTAAGTATTC AAAATCCATT AGGTCATACA CTAATTAAAC ATCTAAACAT 3 8 880 

TACCTTACCG CAAGGAACAA GTTTG CTTAT TCAAGGAAAA TCTGGAGCAG G AAAAAC C AC 3 8 940 

GCTTTTACGC ACCATTGCGG GGCTTTGGAG CTATGCCGAG GGGGAAATAA ATTGCCCTAC 3 900 0 

GCACAATCAG CTTTTCTTGT CACAAAAGCC GTATGTGCCA CAAGGAAATT TAATGTCAGC 3 9060 

ATT AG CTTAT CCAAATAACG CTGACAATAT TTCACATACA CAAGCTGTAG AAATATTAAA 3 912 0 

TAAAGTACAA CTAGGACATT TGGCTGAACA ATTAGAAAAA GAACAAGACT GGACGCGTAT 3 918 0 

TCTTTCTCTT GGCGAACAAC AGCGTTTGGC GTTTGCACGA TTAATTCTAC AC AAAC C AG C 3 9240 

GGTGGCATTT TTAGATGAAG CCACAGCTAG TATGGATGAA GGTTTAGAAT TTTCGATGTA 3 9300 

TCAATTGTTA CAACAAGAAC TACCACAAAC GACAATCATC AGTGTGGGAC ACCGTTCAAC 3 9360 

CCTTAAAACC TT AC AT C AAC AACAGTTAAT TCTGCAAGAC AAAGGGCAAT GGCAAGTGCT 3 9420 

ATAAAAAACG CTAAAAAACA CTGTAATCGC AACGCTTTTT TTGATACATT TAGTGCAATT 3 9480 

TTTTGTATGT TTGACAAAGT CATTTTCTAC TTTTTGTCAA ACAAAACTAA CTTTCATTTT 3 9 540 

AAACGGTATA ATTATGTTAT TTAAAAAAAT TCGTGGGTTG TTTTCAAACG ATCTTTCTAT 3 96 00 
TGATTTAGGC ACAGCTAATA CTTTAATTTA TGTCAAAAGA CAGGGTATTG TACTTGATGA 3 9660 
ACCCTCTGTT GTTGCGATTC GTCAAGATCG CGTAGGCACA TTAAAAAGCA TTGCAGCTGT 3 972 0 
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GGGTAAAGAA GCTAAATTAA TGTTAGGTCG TACTCCCAAA AGTATTGTAG CAATTCGTCC 3 9780 

GATGAAAGAT GGCGTTATTG CAGACTTTTT TGTGACAGAA AAAATGTTGC AATACTTTAT 3 984 0 

CAAACAAGTT CAT AG CGGT A ATTTTATGCG TCCAAGCCCA CGTGTATTAG TATGTGTGCC 3 9900 

AGCTGGTGCA ACACAAGTAG AACGCCGCGC AATTAAAGAA TCAGCAATCG GTGCTGGTGC 3 9960 

ACGTGAAGTA TATTTAATTG AAGAACCAAT GGCAGCTGCA ATTGGGGCTA AATTACCCGT 4 0020 

TTCTACTGCA GTTGGTTCTA TGGTAATTGA TATCGGTGGC GGTACGACAG AAGTGGCTGT 4 008 0 

AATTTCCTTA AATGGTATCG TATATTCTTC TTCCGTGCGT ATCGGTGGTG ACCGTTTCGA 4 014 0 

CGAGGCGATT ATTTCTTATG TTCGCCGCAC TTTTGGCTCG GTTATCGGAG AACCTACCGC 4 0200 

CGAACGCATC AAACAAGAAA TTGGTTCTGC TTATATTCAA GAAGGTGACG AAATTAAAGA 4 0260 

AATGGAAGTA CACGGACATA ATTTAGCCGA AGGTGCGCCA CGTTCATTTA CTTTAACATC 4 0320 

ACGTGATGTC CTAGAAGCCA TTCAACAACC ATTAAACGGT ATTGTAGCAG CGGTTCGAAC 4 0380 

GGCACTTGAA GAATGCCAAC CAGAACACGC AGCAGACATT TTCGAACGTG GTATGGTATT 4 0440 

AACTGGTGGC GGTGCTTTAT TACGTAATAT TGATATTTTA CTTTCTAAAG AATCTGGTGT 40500 

CCCTGTTATC ATCGCAGAAG ATCCATTAAC CTGTGTTGCT CGTGGTGGTG GCGAAGCATT 4 0560 

AGAAATGATC GATATGCACG GCGGCGATAT TTTTAGTGAT GAAATTTAAT CAGAATATGA 4 0620 

ACAAAAAGTG CGGTTAAAAA TAACCGCACT TTTCCTTTCA CTTTTTTCAT TCAACAAGAG 4 0680 

AAAGTTTAAT GAAACCCATT TTCGGCAAAG CCCCTCCCCT AGGTATTCGT CTGCTTATCG 4 0 740 

CTATCTTTGC ATCTATTGCC TTAATTTTAG CTGATGGACG TAATAGCTCA ATAACGAAAG 4 0800 

CACGTAGnnn AATGGAAACG GCAATAGGCG GTTTATATTA TCTTG CAAAT AGCCCTAGAA 4 086 0 

GTATCTTAGA TGGCATTTCA GAAAATCTTG TTGATACCAA TAAATTACAA ATTGAAAACA 4 0 920 

AAGTACTACG CCAACAGCTT TTGGAAAAAA ATGCAGATTT ACTTTTACTT GATCAACTCA 4 0 980 

AAGTCGAAAA TCAGCGATTA CGTCTTTTAT TAAATTCTCC GTTAAGAACC GATGAATACA 41040 

AAAAAATTGC CGAAGTTCTT ACTGCAGAAA CAGATATTTA TCGCCAACAA ATTGTTATTA 41100 

ATCAAGG CAA AAAAGACGGT GCTTATGTAG GTCAGCCTGT CATTGATGAA AAAGGCGTCA 4116 0 

TTGGACAAAT TATTTCCGTG GGCGAAAATA CTAGCCGAGT ACTGCTACTC ACAGACGTAA 412 2 0 

CACACTCAAT TCCAGTGCAA GTTCTACGCA ATGATGTTCG TGTTATTGCG AGTGGGACTG 412 8 0 

GGCATACCGA TGAACTAAGC CTCGATAACG TGCCTCGTTC CGTTGATATA GAAAAAGGCG 413 4 0 

ATCTTCTCGT AACCTCTGGG CTTGGTGGAC GTTTTCTTGA AGGCTATCCC GTAG CTGTTG 414 0 0 
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TACAAAGCGT TTCACGTGAT AGCTCTAATT ACTTTGCAAT AGTTAAAGCT AAACCTTTAG 414 6 0 

CTTTATTAGA ACGTTTACGT TATGTACTTT TACTTTGGCC AAGTAACCTA GATATATCAA 41520 

AAGTAAAATC AATATCGCCA GAAGAAGTTC GCAATCTGGT GCAAAAACGT TTAGAAAGCC 41580 

AAGCTAATGA AG CAAATT AT AGAGTGAAAA AAACACCCTT AACAGAAAAA GACTTTCTAG 416 40 

AAAATGAAGA AAAAAC C ATT ATTGAACCAG ATATTCCAAC AGATTTTCTT CAAAATGAAG 4170 0 

AAGATCTTCC CATAGAACAG CAAGAAGAAA GGATAGAAGA CTAATGCAAA CACGATTTAT 41760 

TTTACAATGG TTCACTATTT TAAGCTTCTT TGTTATTGCC TTTGTATTAG AACTTGCACC 418 2 0 

TTGGCCAGTT GGTTTC CAAA TGCTAAAGCC TGCGTGGCTT GTACTTGTAT TACTTTATTG 41880 

GGTATTAGCT ATTCCCAATA AAGTAAGTAT AGGCTGGTCA TTTTTACTCG GATTAACTTG 41940 

GGATTTAATA TTAGGCTCTA CATTAGGCGT TCACGCTCTT GTACTTTCTA CGAGCATGTA 42 000 

CAT CATTG C C AAAAACTATT TAATTCTGCG AAATTTATCA CTTTGGTTCC AAAGTTTATT 4 2 060 

GGTTGTACTT TTCGTCTTCA TTATAAGATT ACTTATTTTC TTAGTTGAAT TTTCACTCCA 42120 

TACTGCCTTT TTTCATTGGC AAGCCATCTT GGGCGCATTC GCTTCTGGTT TACTATGGCC 42180 

GTGGGTATTT TTATTAATGC GCAAAATACG CAGAAAAGTT AAATTACATT AAAAAAGAGA 422 4 0 

GATAACTCTC TCTTTTTTAT GATTTATGAA TGTTTCAAAC ACCTAATTCA ATGGCTTATT 42 3 00 

GCTATCCAGT AAAGAATCAC ATTCCCAACC GATATACTCT CCCGAAAATT GTTCAGCCAA 423 6 0 

TTTTTCAAAA CGTTCTACTT GTTCAAAAAT TTCGCTATTT TCTAATGAAA TTTCTTGCTC 4 2420 

TAATTTT AC C AAGAAATAAG GTTCGTCATC TTCATTAAAC TGAATCGCTT GTGCAGAATA 4 2480 

TTGTAATGTC GTGAAAGAAT AATCCCCTAA ACTAAGCTCA TCCATAAAAG GGAACATTTT 42 540 

CTGCTCTTCA TCAAAATGAA AACTATGCTC TACTAAATAG GTATCACTTA AATTACGACC 4 2600 

TTTACTTTTT AATAAATCAA GCAATTCTTC CGTTGCATTT AACTTAATCT C C AATGGAG A 4 2660 

AG CG AG CAAA AAGTCAAAAT AAGTATCCCA AATTAAATCC TTTTGCACAT TAATATCTTT 4 272 0 

AACAAAATCA ATTTGTTGTA AAGTTTCTAC AATAAGATTT TCATGCTTAC AATAAAAATG 42 780 

TATTTTTATT T G AG AATT AC AAATAAAATG CCCCGCAAAA AATACATTTG GAAGTGCGGT 4 2840 

TAGTTGTGTC AAAATTTTAA ATACACGGTT AATTAACTTA TCGTATTCAA CTTCTGAGGG 4 2 900 

TAAACCCGTT TCATCATCAG CCTGATAATC AATAGTAAAC TGCACTATTT TAT CG AG AC C 4 2 960 

TTTACCGTGA TATTTTTCAA CATCTTCAAT ATTTGCGGTA AACACGGCAG GCATACCATT 4 3 020 

CACAGTAGAA CGATAATTCT GCCAATTTGC CATATCCATT TCGCTTTCCT ATTTAAATTC 4 3 080 
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AGCCCAAATC GGTGCGTGAT CAGACGGTTT TTCCATTGCT CGAATATCTA ATGCAATGCC 4 314 0 

CACATCGACA CATCGTTCAG CTAATTTTTG GCTTACCAAA ATATGATCAA TTCGTAAACC 4 3200 

TCGATTATCA TCAAAACCTT TAGAACGGTA ATCAAACCAC GAGAATTTAT CATTTGCTGT 4 3260 

TGGGTTC AG C TTACGAAAAC TGTCTTCTAA TCCGTAATCA TACAAACGTT GATACCACGC 4 3 320 

TCGTTCTTCC GGTAAGAAAG AACATTTCCC TGTGCGTAAC CAACGTTTAC GATTTTCATC 4 33 80 

TCCGATACCA ATATCTAAAT CACTTGGGCT AATATTCATA TCGCCCATAA TTAAAATCGG 4 3440 

ATTCGATTTG TCGTGTTCTT TTTCTAAATA CTGTTGAAGA TCTGCATAAA ATTTCTCTTT 4 3500 

AGCTGGAAAT TTAGTCTCGT GCGCACGGCT TTCCCCCTGT GGGAAATAGC CATTAATGAC 4 3 560 

AGTCAATAAG CCAAATTCAG TTTCCAGATC AGC CAT AATA ATGCGTTTTT GTGCATCTTC 4 3620 

ATTGTCAGTT GGAAAAC CAC GACGAATCAC TTTAGGTTCT TGTTTAGTCA ATAATGCCAC 43 68 0 

GCCATAATGA CCTTTCTGAC CGTGATGAAA TACGTGATAA C CTAAGTTTT CTGTAATTTC 4 3 740 

ATAGGGAAAA GCCTCGTCAG CAACTTTAAT TTC CTGTAAG CCAATCACAT CTGGCTGATA 4 38 00 

TTTTTCAATA ATGGCTTCAA GTTGATGTGG GCGAGCACGT AACCCATTTA TATTGAAAGA 4 3860 

AATAAATTTC ATTTTTGTTC CTATTAAAGA GTGAAATTGG CTTTATTATA CAGAGATTTC 4 J 920 

TATTTTGATA AAAACGGCTT TGTCAGATAA ACCTGAATTT GCTCAAATAC TGTAGG CTTA 4 3980 

ATC CAAAATT GACCTTGAGA AGGTAAACAA GTGACCGAAA TAAAATTATT TAGATAATCA 4 4040 

AATTCCAATT GAGTAGCGTG CAAATAAGTG CGGTCAATTT TTTCTGTGTT TTTACCATAA 4410 0 

AGCCCATCGC CTAAAATCGG GCTACCTAAA CTTTTCATCG CAACACGTAA TTGATGCGTT 4416 0 

TTCCCCGTTT GTGGTTTTAA AATAAATAAA CGTAAATTAG GCTCACAACT TACACTTTCA 4422 0 

AAACGCGTAA TTGCGGGATT GTCTTTTGTT TGGCAAAGTT TCCATGCGCC TTCTCGAGCT 442 80 

TTCTTCATAT CCCCGATAAT CAAACCTTGT TTCTTTTTAG GCTTTTGGTT ACTTAATGCC 44 34 0 

AAATAAGTTT TATGAATTTT ATGTTCAGAA AAAAGTCGTG AAAATTCAGC CGCACTTTCA 444 00 

GCATTTAACG CCAGAnTTAA CAAACCCGAA GTTACTTTAT CTAAGCGATG AACCAACCAT 4 4460 

ACTTTAGGCA CATTCAGTTG TTTTGCAACA AGTTCTGTTA AGCCTTGTTC TTCTTGATCT 4 4 520 

TTATGAACGC TAATACCCTC AGACTTGTTG ATAATAATAA AATCGCTATG TTGATAAACA 44 58 0 

ACCTCGATTT CCATCAGATT TGCTCCTTTT TGTAAATTTT ATTTTTACCG AGAACAATAT 4 4640 

CTTTTACCGA ACTAAGTAAT TCCTGTTGAA TATGTGACAG TTTTGGTTTT TCGTTTAATT 44 700 

GAAATGCTAA CGGACGACAA AGTTCCATTG CTTTCACGCC TAATCT AG CA GTGAGTAATC 4 4 760 
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CCACACCAAT TCCTTGAGCG ATACGTGTAG ATAATTTTGC TGTCACATCT TGAGAAAGCC 4 4 820 

AATCCATTCC AAT AT CTTGT GCCACTTCGG TTGCTCCAGC AAAAG CG AT A TTGACCAATA 4488 0 

CCATTCTTAA CAATCTAATA CGTGnAAAAT ACCCTAATTC AATCCCATAA ATTTCAGCAA 4 4 940 

TCTTATTCAT TAATCGAAGA TTGCGCCACG CAATnAAAAA CATATCCACT ACAGCAAGTG 4 5000 

GACTAATCGC GACAATCACA GCTGATTCAG CTGCCATTTT ACTAATCAAT TTTTTAGCTT 4 5060 

GTGCATCAAA AGAAGACAAA ACATGACGAC TGAATAAATG AGCAATTTCT TGCG CCGAAT 4 512 0 

AAGCTTCATT TAATTGGTGT TGCCATTGAA TTACCGTTGG AGAATCATTT TCCAAACCTA 4 5180 

AAGATTTTGC AATATCTAAG CAAAGAATTT TACTTTTTTC TGCGTTATGA ACGGAAAAAA 4 5240 

CATCACCATT TTTTACCGCA CTTTCTAACC AAATCTGCTG ACTTTGTTGT TGCCATTGCT 4 5300 

CACGTTTTTT TAAACGAACT AAACGTCGCC ACTCACAAAT AATCTCTTTA ATACCCAATA 4 536 0 

AAATGATAAT CAAACTGACT AAAGC AAAAG CAAGATAAAT CCATTGATGT TTTTGATAGC 4 5420 

TATCCCAAAT CCACTGCACG GATTGCGCAA CCGTCGCCAA GCCAAATAAA AGTGCAGTAA 4 5480 

ATTTTAATAA AGTTTTTCTA AAACTGGATT TTGGTTTCAA TGCTTGTTCA AGTTGAGCAT 4 5540 

CAAGTAACTC TCCGTCCAAC GCTTCATCTA GCTTTGCTTC CATATTATGG AAnTCTTGCT 4 5600 

TGGGCTGATA GTGTTCTTCC TCTACATTAA CTGAATGTTC AAAAATTTGT TTTTCCATTG 4 5660 

CCTATCCCAA AAAATAACCC CACTAAATAG TGGGGTTATA AAGTCAAAAA TCAATCATTA 4 572 0 

TTCATCACTT GnAAAGATAG AAAGTAATCT TAATAAACTC AAGAATAAAT TGTAGATTGA 4 5 780 

AACATAGATG TTCACGGTTG CGCGAATGTA ATTTGTTTCT CCACCGTGAA TAATATTACT 4 5 840 

GGTTTCATAC AAAATGGTCA TTGTTGAGAA CACTACAAAT AATGCACTAA TTGCGACCGA 4 5 900 

TAAAGCAGGA ATTTGGAAGA AGAAACTCGC CACCATACCA AGCAATAACA CAATAAATAA 4 5 960 

TGCGAACATT GCGCTTGATA AGAATGACAT ATCTTTTTTC GTAGTCAATA CATAAGCGGA 46 020 

ACACGCAAAA AATACGATAG CAGTTCCTGC GAATGCAAGC ATAATCAAAT CTTCCATTCC 46080 

ACGTGCCACA TACATATTTA AAATTGGCCC GATGGTGTAT CCCATAAATC CAGTGAAAGC 46140 

AAAGGCCGCC AAAATACCAG CAGGGCGATC AGCTAATTTA TTGGTTAAGA AAAGTAAGCC 4 6200 

ATAAAAACCC ACTAGTAACA CAATAATATT TGGATAAGGC AAGTTTAAGC TCATTGAAAT 4 6 260 

AAACGCAACA AC CG CTG AAA AAGCCATTGT TAATCCCAGT AAAAAATACG TGTTACGCAA 4 6 320 

CACTTTATGT GTGCTGAGTA ACGATTCTTT TTGAGCATCC ACAATAAGAC GTGATTGCAT 4 6 380 

AAAAGCTCCT TGTAAAATAA ATAAAAAGTC TTCTGTAAAG TAAGGGCTAG GGATAAAAAA 4 644 0 
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GTCAATGGTA TTAATGGAAT TAATTTCATC AAAATGAATA AATTATCATC ATCTAAAGTT 4 6 500 

AAATCATCAA ATTT CTATTT ACATCAATAA AATATTCATT ATACTACGCG CATCAATCAC 4 6560 

GGAGGAATGG TCGAGTGGTT GAAGGCACCG GTCTTGAAAA CCGGCGAGGG TTTACG C CCT 4 6 620 

CCTAGGGTTC GAATCCCTAT TCCTCCGCCA TCAGAATTAA GAAATCCCTA AGTCACAAGG 4 668 0 

CTTAGGGATT TTAACTTTGT ATTACGTTAT TTGGTCAATC CATTCACTAG TGAAATAACA 4 6 740 

GTCTGAATAT CGCTATCAGA CAATTTTCCT TCTTTCACAA ATTTTACTTT TCCCGTTTTG 46 8 00 

TCTAACACGA TAATTGCACT ATCTTTAGAA TTTAATCCCC AAGCATTTTT TACCGCACTT 4 6 860 

TTATCATCTA ACACAACTTG GCTATGTGGA TTTTCCTGTT TGCCTTTTTT CGCACCATTT 4 6 920 

TTTACAAACA TACCTGTACC AACAATCGCA TCATCCGCAT TAATAATAGT GGTTGTTTGG 4 6 980 

TATTTTACAG GAT T AAAATG GGATGCTTTT ATCACATCAA TCATCGACTG ATTTTnCTCT 4 7 040 

TTTGCAGCCG TTCTACCCGC TAAATGATGA ACAACACGCA CTTTACCCGC TAATTCCGCA 4 7100 

GAACCCCAAG GTTGAAACAC AGTATCTTTT CCAGAAAGCA CAATTTCACC ATACTCACTC 4 7160 

ACTTTGACCG ATGGTAAAGA TTGTTCTAAT TGCAAATTGT GTGCCCAAGT TGAACTACTA 4 7220 

AATATCACGC CACAAACTAA GGCAAGAATT TGTTTTTTCA TACATTTTCC TCAAAAATTT 4 7280 

ATCAAAAACT TTCACGCTTT GGCGAAAGAC ATCTTTTTGC TATACTCGTG CGCCTATTAT 4 7340 

ATGTAATCAA GAAGGAAAAC CCAATGGATC AAATGCCTAA TTGCCCAAAA TGTCAAAGTG 4 7400 

AATACGTTTA CCACGATTCA ATTAATTTTG TTTGCCCTGA TTGCGGTAAT GAATGGGATA 4 7460 

ATAATGAAAT TCAAGAATCA GATGACGATC AACTAATCGT AAAAGACAGC AATGGGAATT 4 7 520 

TGCTTGCAGA AGGCGACAAT GTACTTTTGA TTAAAGATTT AAAATTAAAA GGTTCATCAG 4 7 580 

AAGTATTGAA AAAAGGTACA AAATTCAAAA ACATCCGCTT AGTAAATGGC GATCATAATG 4 7640 

TCGATTGCGG CAAAATCATG TTGAAATCTG AGTTTTTGAA AAAGGCTTAA TCGAAAAAAC 4 770 0 

GCGGTCTAAA TAGACCGCAC TTTG C ACTTT AAAGAAATAA ATTT ATTT AA TAATATCAAT 4 7 760 

CACTTCTCTG ACTAATCGTC CAATTTCATC CCAATTATTG GATTGAATCA ATTTTTTCTC 4 7820 

CACAAACCAA GAGCCTCCAC AAGCAACAAT ATTAGGAATA GCAAGATAAT CCCTTATGTT 4 7880 

ATGAAGCCCT AT AC C AC C AG TCGGCATTAT CTGTAATTGA G C AT AAGG AC CAAGCAATGC 4 7 940 

TTTAATCATT TTCACTCCGC CTGAAGCCTC TGCAGGGAAA AATTTTACCG CCGAAATTCC 4 8000 

CATTTCTAAT GCAATTTCAA TTGCCATAGG ATTATTCACC CCTGGTGTTA TTGGAAAATT 4 8 060 

CAAATCTTGA CATAATTTCA CAATTTTTGG ATTTAACCCA GGAGTAACCA CAAAATCTGC 4 812 0 
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TCCGCTACTT TTAGCTAAAA CTACTTGTTC CGCAGTTAAA ACTGTGCCCG C AG C AATTAA 4 818 0 

AAAGTCAGGT CTATTTGCAC GCAACAAACG GATCGCATCT GCCGCTGCTT CAGAGCGAAA 4 824 0 

AGTAATCTCT GC CAC AG ATA AACCATTTTT AGCCAACGTA TCTGCTAGCG GCAAAATATC 483 0 0 

GTCTGCATTA TCCAACGCAA TTACAGGCAC AATTTTG AG T TCACGAAGTT TTTCAATAAT 483 6 0 

TTGTTGAGTT GTGTATGACA TTTTTTTAAT TTGCTCCTAT TAGACTCCAC TATACGCAGA 4 8420 

AAAACCG CCA TCAACAGGTA AAACCACGCC ATTGACAAAA GCAGAATAAT TTTCATCTAT 4 8480 

TAAGAAAAGT AATGCACCGA GCAATTCTTC TGACTCACCA AAACGTCCCA TTGGTGTGTT 4 8 540 

AGTGAGGATT TTATTTGCTC TATCTGTTGG TTTTCCTTCA GTATCAAATA ATAAAGCGAG 4 8600 

GTTTTGATTA CTAACTAAAA ATCCTGGTGC GATAGCATTA CAACGAATAC CAACTTTCGA 4 8660 

GAAATATACA GCAAGCCATT GAGTAAAATT ACTGATTG C A GCTTTTGCAC CAGAGTAAGC 4 8 720 

AGGAATCTTT GTGAGTGGCG TAAAGG C ATT CATACTGGAA ATATTAATAA TATTTGCCCC 4878 0 

TTGTTTACCA AG C ATATCTT TTGCAAAAAC TTGGGTTGGT AATAATGAAC CTAAGTAATT 4 8 840 

GAGATTAAAT ACAAATTCTA T AC C AGATTT ATCTAAATCA AAGAATGTTC TTGTCGTTTC 4 8 900 

ATTTAAATCA AATTGATGGA ACTCATTATC TGTTGTCGCT TTTGGATTAT TCCCACCAGC 4 8 960 

ACCATTAATT AAAAT AT CAC AAGTGC CAAA ATCGGTTTCA ATTTGATTTC TCACTTCTTT 4 9020 

AATATTTTCA AGTTCTAACA CATTAGTTTT AT AGG CTTTT GCTTTTCCTC CTGATTGGTT 4 9080 

AATTTCCTTT GCTACTTTGT CTGCAGCCTC GAAATTCAAA TCTAACAAGG CAATATTAGC 4 9140 

TTTAGTATAC GCTAATTGCT TAGCGAGAAA TGAACATAAC ACTCCGCCAG CCCCCGTGAT 4 9200 

TATAATTAGT TTATTTTCTA AGTTATGGTT TGCTGCAATA TTCATAGTAA ACTCCATTAA 4 9260 

TTAAATTCAG ATTGAAGATG AGAAATTGCG ACTTTATCTA TAATCGCACC TTTATGTTGA 4 9320 

ATG AC GAT AC C AG C AATACG ATTGCCTTGC TGACAACAAA TTTCGAGAGA TTTATTTCGT 4 9380 

AAATAACCGT TCAGAAATCC AGCATTAAAA GAATCACCAG CCGAGGTTGT ATCCACTACA 4 9440 

TTTAAAATAG GTTCAGGTAT CACTTGCCCA TATTGAGATA AATACCTATC TGAAAAAATA 4 9 500 

GCACCATTCT TACCGCACTT CACAATCACT TTTGGTATAC CAATTTTATG TAATCGCTCT 4 9 560 

AAAGTATCCC TAGATGTTTT ATCTTTCCAT AATGCTTGTT CAT CATC AAA AGTCACCAAC 4 9620 

G CT AT ATT G A CACTAGGCAA AAGCTGCAAA TAACAATCTT GTGCTTCTTC TAATGAATCC 4 9680 

CAAAGTTTTG GGCGATAATT ACTATCAAAA ACAATTTCTG TGCCTTTTTT AGCTAAAGAA 4 9740 

GATAATTGTT CAATTAAAAA AGTGCGGTCA TTTTTAGGTA AGATTGCTAA TGAAATACCA 4 98 00 
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CTTAAATAAA TCACATCAAC TTGCTGAAGT TCTGCAATAA CCTTAGCAAA ATCCGGATGT 4 9860 
TG AAC CAT AT AACGTGCTGC AGATTGATTA CGCCAATAAA GGAACGTGCG TTCGCCTTGA 4 3920 
GCATCTAATT GGATTAGATA AAGACCTGGT TGGTG CTGTT CATCCTGTAA AACCCAATTT 4 9980 
GTTTGGATGC CATCAGCTTG CCAATATTTA AGCATTTGCT TGCTTAAATT ATCGGTTCCT 5 004 0 

AATGCAGAGA CATAATGAAC TTGAATCTCT TTTGATGAAC TTACTCGACT AAGATAAGTT 50100 

GCAGAATTTA AGGTATCTCC ACCATAACTT TGCCACATTT CCGCAAAGGG TTTCCCATTG 50160 

AGTTCAATCA TACACTCGCC GATAAATGCT ATTTTTTTCA TGATGTAACT TC CTAAT AAA 50220 

TCAAGATAAC CAATTCATTG GAACAATTAC AATTTCAGGA AAAAATACGA TCAACATCAT S0280 

AATAATTGCG TACACAAATA AG AATGG AG C AATTC CTTTG GATAATGCTC CTATCCCTAA 50340 

TTTTGATATC CCTGAGCCTA CATACAAAAC TGTTCCCACA GGCGGTGTTA TTAATCCTAT 504 00 

AGATAAATTA ATCACCATCA TGATTCCAAA ATACG CAATA TCAATATTAT ATGCTCTTAA 50460 

CACTGGTAAT AAGACTGGAA CAAAGATTAA TACCGCAGGA ATTAAATCCA TTACACATCC 5 OS 20 

AACAAGTAAT AAAAATAGCA TTATAATAAA CATTAAAATT GTCGGACTAT CAGTTAATCC S0580 

TTTAATCGTT TGGACTAATT CTGTTGGAAT TTGAGCAACT GTAATCGCAA AAGCAGATAT 50640 

CATTGCTGAT GCAGCAACAA ACATTACCAT TGATGTTGTT TTGATGGTAC CAATGAAGAC 50700 

GTCTTTTAAT TTAGAAAATG TTAAGCCTTT ATAAGCAATA CTTACTATGC CAGCATAAAT 50760 

TGCAGCAACA ACTCCCGCTT CTGTTGGAGT AAAAATCCCT CCTCTTAATC CCACAATAAT 5 0820 

AATGATAGGT AATAATAAAG GCCAAAATGC TTTTTTAAAA GCAACCCATC TTTCTTTGCC S0880 

AGTTTGTTTT CTTTCTAGAC TTGTATCATT ATTTCTATAC AAAATTTTCC AAGTAACCCA 5 094 0 

TAATCCAACG ACCATTAATA AGCCTGGTAC AGTTCCCCCC ATAAATAATT TAGTGATGGA 51000 

TGCTCCAGCA GTAATACCGT AAATGATCAT AGGAATGCTT GGAGGAATAA CAACTGAAAT 51060 

AATACCTG C A GCACAAATTA ATCCTGTGGA ACGAGCACCA TCATATTTTT TAGAAATCAT 5112 0 

CATAGGAATT AGTATTGCAC CTAACGCGGC CGTATCCGCT ACCGCAGAAC CAGATAACCC 51180 

TGCAAATATC AACCCAGAAA TGATCGCAAC ATAACCTAAA CCACCTTTAA TATGACCAAC 51240 

CATACTTGTT GCAAAATTAA TGATTCGCTC AGAAATTCCA CCATGCTTCA TTATTTCTCC 51300 

AGTAAGCATA AAGAAAGGAA TAGCCATCAA AGGnAAATTA TTCGTTCCCA TCACAAAGTT 51360 

TTCTGTAATT AATTGCGTAT CAAACATATC AAGATGATAT AGCATAATAC TCGCACTCAA S142 0 

CAGCATTGAT AGTGCTACAG GTACACCAAG AAATAACAAT AAGAATAAAC TACACAAAAT 514 80 

-77.32- 

SUBSTJTUTE SHEET (RULE 26) 



BNSDOCIO <WO. _ 963327 6*1. 1.> 



51600 
5X660 



WO 96/33276 PCI7US96/05320 

AATAGTAACA TTATAAGCTC CTATTTATCA GAGGATTCTT TATAGTCTTT AATCATGCTA 5154 0 
AACATGACAA TAAAAAAGTA AGATACGGCA GATATAACAG CAGCTAAATA TAAAAAAGAT 
GnAGATATAC CTGTAGCTGG CATTTGTTGA GTATAGGTCA AAGACATCAG TTGTAATGCA 
CCATAAGCTA TAAAAATCAT TGCAATCAGA ACGCATATAT TTGCAACTAT TAAGACTATT 5172 0 
TTTCTATACT GCTCAGGCAA CGCAGGAATA ATAATATCAA CAGTAAGATG AGCTTTATCC 51780 
TTGGCnACAA GTATAATCCC AAAGnAAATC ATATATACAA AACAAATCCT AGAAAACTCT 
TCAGAAAATG CAATTCCAGA ATCAAAGAAG TAC CTCAAG A CAGAATTAAA AAACACGAGA 
GAAATCATAA TAACTAATGC CAAGATAGAA AGATATTCTA ATGCTTTGTC TAAATACTTT 
GCTATTTTCA TAAAACAAGC TCCTTCTTTA TCCCCCATGA GCAATTTATA AAATTGAACA 
TAGGGGATCT TATTAATTAT TTAGCTTCGT TAATTTTTTG GACTACATCC TTAGCCCAAT 
CATATTTTTT AT AAAAACT A TCATAAAGCA CCTTAGATGC ATTCACCATG TCATTTTTAA 
ATGCTTCACT TGGTACGATA ATTTCATTTC CATGTTTAGC TAGATAATCT TTAACTTCTA 
ATTCTGTTTT AATTGCTAAT TCCCACTCTT TCTCTTGATA CAATTTAGCT GCTTTTTCAA 
AAATAATTTG ATCTTCTTTT GGAATTTTAT TCCAAGTTTT TAAGCTAATT TGTAATAATC 
CTGGAGAGAA AATATGGTTA CTTTGGATAA TATATTTTTG TACTTCATAT AACCCAGAGT 
CTTTAATTAA CATATACGGA TTTTCTTGAC CATCAATTAC CTTTTGTTCT AATGCAGTAA 
AAACTTCACT GAGCGGCATG CTTTGTGGAT TTATATTCAT TGCTTTAGCC CAATCAACAA 
AAAGTGAATT CAGCGGAACT CTTAAACGTA AACCTTTAAA ATCGTCAAAT TTCTCTAATT 
TTTTATTAGA TGAAATAACA CGAAATCCAT T CAT ACC AT A GGCAAGTAAT TTAATTCCAT 
ACTTCTCCAT TTTTTGACCA ATTTTATTTG CGATTGGAGA TTC C AAT ACT TTTTTAGCCT 
CTTGATTATC CTTAAATAAA TAAGGCCATT CCCAAACTCC AAAAATAGGA TCTACATTTT 
GCAATAATAA ACCTGTAATA CCTGCTTCTA TAGTGCCATT ACGTAATCCA TTTACAATTT 
GATCTTCACC GCCAAGTTGA TTATTAGGAT AAAGTCTAAT CTCATATTTA TGATTTGTTT 
CATTTTCAAT TGTTTTCTTA AAATATTCCT GTAATGCAAT ATGTGATGGA TGAATATCAG 
AATTATAATG TCCTAACTTA ATAATTGTTT TTG C AAATGC ACTGCTACTA AACAATATAC 
TTGAAGCAAG TAATAATTTA CCTAAGCCTT TAATTGATTT CATAACTTTC TCCTTATTGA 
TTATGTTTTT TAGAAAACTG CATTAATTTT TAAATAATTT TCTGGATGAT CAGATAAATC 
AGAAATAAAA CTAGGAATTT CTTCAAATTT GATACACTTA GTAATGATAT GAGATGCATT 
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AATTTTATTA GTAGCAAGCA TATCCAATGC CTCCTCAAAT TCCCCCTTGC TTGTTCTAGA 5 3220 
ACCATAAATA TTCAATTCTT TAAAGGTAAT TAAATTTGTT GGTAATGGAG TTTCTGTTTT 5 3 280 

TGGCCATCCT GTTAAAGCAA TTCTTCCTGC AAAGGAGGCA TAATGCAGAG TATTTTtGAT 5 3 340 

AGAAATATTA GCACCAGATG CCTCCATAAC AACTTCAGCC ATACGACCAC TTGTAATTTC 5 3400 

CTTAATTCTT TTAATGTCAT CCTCCTTATG CGGATTTACA ATATGTTCAA TTCCCAAGGA 53460 

T TTTG CAT AC TCAAGACGTT GTTCTAATAT ATCAACTAAA ATTGGGATTG CTCCATATTG 5 3 520 

AACCGCTGCT AATGCAGCCA TTAATCCAAT TGCACCAGCT C CAATAATT A CTATGTGTTC 5 3 580 

CCCTGATTTT AAAGTTGTTC TATGTAAAGC GTGTAAAGCT ATTGTTAAAG GTTCAGCTAA 5 3 640 

AGGTAATTGA TGAATCGGTA GGTTATCTGG GACTTTAGTT AAAAGATGAG CGGGATGTCT 53 7 00 

AATTACCTCT TGCATACCTC CATCAATATG AACACCAATT AC CTTTAATG ATTCACAACA 53 76 0 

ATTGGTTCTA CCAATTGAAC AAGGATAGCA TTGACCACAA CAAACATAAG GATCAACAAT 53 82 0 

TACTCGGTCT CCAACCCTTA CCCCATCTGA CATACCAATT CCACTTTCAA TTACTCTCCC 53 880 

TACAATTTCA TGCCCAAGTA TTCTTGGATA AGTTACTAAT GGATTAGTTC CTCTATATGC 5 3 940 

AC CAATATCA GAACCACAAA TCCCCATTGA TTCTACCTGA ATAAGTACAT CATTATCACT 54 000 

TTTCTCGGGA TAAGGCACTT CTTTAACAAC TACATTGTTT GGTTTTTCTA AACAT ATTG C 54 06 0 

TTTGATTCGC TCCTTCATTT TTAGACTCCT TATATG C CAT TGACAGAATA AAATTACTTG 54120 

TTATGTTTAA CTATGATATG ATTATGAAAC ATAAAAACTC TTTTTGGAAT AC CAAAT ATC 5418 0 

TATTTTTGTG AG C C ATATC A CAAAAATAAA AAATTGGAAT ACCAATTTAA GGAAAAATAT 54240 

GGAAAATATA GTAAATAGAA CATATACCCG AATAGGTCAA TTGCTAAAAC AAGATATTTC 54 3 00 

TCAAGGAATT T ATAG C ATTG GAGACAAACT CCCTACAGAA CGAGAAATTT CTGAAAAATT 54 360 

TGGTGTAAGT AGAACTATTG TTAGAGAAGC AATGGTTATG TTAGAAGTAG AAAAGTTAGT 544 2 0 

AGAGGTAAAA AAAGGGTCAG GCGTTTATGT AGTAAGAACC CCCGAATCAA TACATATGGA 54480 

ACATTCAGAT TTACCAGATG TAGGTCCATT TGAACTTTTA CAAGCTAGAC AACTATTAGA 54 54 0 

AAGTAGTATT GCTGAATTCG CAGCGTTACA AGCGACTAAA AAAGATATTT TAAATTTAAA 546 0 0 

ACAAATACTG AAT CG AG AG A AAGAATTACT TACTCAGAAT CAAGATGACT ATAGTGCAGA 54 66 0 

CAAAGATTTT CATTTGGCAC TTGCAGAAAT TACACAAAAT GATGTGTTAG TCAAATTACA 54 72 0 

AGAACAATTG TGGCAATATC GCTTTAATAG TGCGATGTGG GCACAATTAC ATTCTCGTAT 54 78 0 

TTTACAAAAT GATTATCATC ATTTATGGAT AGAAGATCAT CAAACAATTT TATCTGCTAT 54 840 
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TCAAAAGAAA AATGCTAATG AAGCTAGAAA AGCTATGTGG CAACACTTAG AAAATGTAAA 54 90 0 

AGTAAAATTA TTTGAATTAT CTGATGTAGA AGATCCACAT TTTGATGGAT ACCTATTCAA 54 960 

T AC AAATC C A GTTGTTGTAG GTATCTAAAT ACTTGTAATA ACATAGGAGT TAAAAATGGA 5 5020 

ACAAG CATGG CGTTGGTATG GACCAAAAGA TCCTGTTTCT TTATCTGATA TTCGTCAAGC 5 5080 

AGGCGCAACT GGTATAGTCA CTGCACTTCA CCATATTCCT AACGGTGAAA TATGGgGTAT 5 514 0 

TGAAGAAATT AAAAAACGCA AAACTGAAAT AGAAAATGCT GGTTTAAGCT GGTCAGTAGT 5 5200 

GGAAAGTGTA CCAGTTCATG AAGAGATTAA AACT C AG ACT GGAAATTATC AAAAATGGAT 5 5260 

TAATAACTAC AAACAAACTC TTCGCAATCT TGCTCAATGT GGAATTGACA CTGTTTGCTA 5 5320 

CAATTTTATG C C AG TTTTAG ACTGGACTCG AACCGATCTT GCTTATGAnT TACCAGATGG 5 5380 

TTCTAAAGCA TTGCGCTTTG ATCATATCGC TTTTGCCGCA TTTGAACTTC ATATTTTAAA 5 5440 

ACGTC CTGAT GCAGAAAAAA C CT AT AATCA AGAAGAACAA GTTGCAGCAA GAACCTATTA 5 5 500 

TGACAATATG TCAGAGCAAG ATATAGCGCA ATTAACGCGT AATATTATTG CGGGCTTACC 5 5 560 

CGGTGCAGAA GAAGGTTACA CTCTTGATGA ATTTCAAACT CAATTAGATC GCTATAAAGA 5 5620 

TATTTCATCC GAAAAATTCC GTACGCATTT GGCTTATTTT TTAAATGAGA TCGTACCTGT 55680 

TGC CCAAGAA ATAGGAATCA AAATGGCGAT TCATCCAGAC GATCCTCCAC GCCCAATCTT 5 5740 

AGGCTTACCA CGCATAGTTT CAACTATTGA AGATATGCAA TGGTTTGTAG AAACTCAACC 5 58 00 

ATTACCAGCA AATGGTTTTA CAATGTGTAC GGGTTCTTAT GGCGTACGTt C CGATAATG A 5 5860 

TCTCGTAAAA ATGACTGAAC AATTCGCTGA TCGTATTTAC TTCGCTCATT TACGTTCTAC 5 5 920 

TCAACGTGAA GGTAATCCAC TAACCTTCCA TGAGGCAGCT CATCTAGAGG GAGACGTAGA 5 5980 

TATGTTTAAT GTAGTAAAAG CTTTATTAAA TGAAGAATAT CGTCGATTAA ATCAAGGAGA 56 040 

AACGCGTTTA ATCCCAATGC GCCCAGATCA TGGGCATCAA ATATTGGATG ACTTACGTAA 5610 0 

AAAAACCAAC CCTGGCTACT CTGCTATTGG TCGGTTAAAA GGTCTAGCTG AATTTCGCGG 56160 

TTTAGAGATG GCATTAAAAA AAGTGTATTT CAATAAGTAA AAGAAAAAGT GCGGTCATAA 5622 0 
AAAACATTAT TTTTATCGAC CGCACTTCTA ATTTATTTCA TCAATCTTCT CATTCGAATA 562B0 
TTGATCATCT CGACCACAAC AGAGAATCCC ATTG C AAAAT AGATGTAACC TTTCGGGATG 56 34 0 
TGAATATCCA AACTTTCTGC AATTAAACTA ATTCCGACTA AAACCAAGAA AGCTAAAGCT 5 64 00 
AGAATTTTCA ACGTTGGGTG TGTATC C AC A AAATCGCCGA TTGGTTTTGC CGCAAACATC 5646 0 
ATCACCCCCA CGGCAATCAT AATCGCCAAA AT CAT C AC AG GAAGGTGGCT TGCCATACCC 56 520 
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ACCGCTGTAA TGACAGAATC TAAAGaAAAT ACAATATCTA GCACAGCAAT TTGAATTAAT 56 580 

ACGCCAAGAT AACTCACTTT GTTTTTGCTT TCAGATTCAT GGTGTTCTTG ATGATTTATC 5o64 0 

GCCTCTTTAA TTTCGCCAGA ACTTTTTATA ATCAGGAATA AAC C AC CG AT TAATAAAATC 56700 

AAATCACGCC CAGAAATTTC TTG ATT AAA C ACAGTAAATA ACGGTGCAGT AAGTTTCATT 56 76 0 

ATCCACGCTA GtGaCATTAA AAGTAAAATT CTTGTTAGCA TTGCCAAAGC AAGCCCAAGA 56 82 0 

ATACGACCAG ACTGGCGTTG GCGTTCAGGT AAACGTCCAA CTaGGATGTT AATAAAAwTA 56880 

ATATTGTCGA TTCCTAAAAC GATCTCTAAA GCCGCAAGTG TCACAAGTGA TATCCACGCT 56 940 

TCAGGATCTG CAATCCATTC AAACATAATT TTTCCTATAA TTGAGTTAAA TAAATCCGAA 5 7 000 

AATACAGATC ATAATGGCTT TATGACGAAA AGCAATCGTT ATTTATATTT GAGAAATGAT 5 706 0 

TGATTAGTCG TTTTTAAGCG TTTCAAAAAT CCTTTCAGCA AGTTTAGGGG AAATGCCCGG 5 712 0 

TACAGAGGCT ATTTCATCTA AGGTCGCCTT TTTCACACCT TGTAAACCAC CAAGATATTT 5 718 0 

CAGTAAAGCC TGACGACGTT TAGCTCCCAC ACCTTCAATG GTTTCTAAAC CACTTTGTGT 5 7240 

AAAGGCTTTT TGACGTTTTT TACGGTGTCC GCTTATTGCG TGATTATGGC TTTCATCACG 573 00 

GATGTGTTGA ATCAAATGCA ATGCCAAACT ATCATCAGGC AAATGAATTT CGCGGTCTTG 5736 0 

TTTGCTGATG ATCAGTACTT CCTGTCCTGC TCGACGATCC ACACCTTTTG CCACGCCAAT 5 7420 

TAAGTGCGGT CTATTTTTAT CCCATTTTAC TTGAAGATGT TGGAATACAT TCAGCGCACG 5 7480 

ATTTAATTGC CCTTTTCCAC CATCAATAAA AATAATATCG GGAATTTTAT CTTCTTCTAA 5 7 540 

ATCACGCTCA TAACGTTTCT GCAACACTTG TTCCATCGCA GCATAATCAT CCCCGCCAGT 576 00 

AATGCCTTCA ATATTAAAAC GACGATAATC AG ATTTT AG T GGACCTTCTT TATTAAAAAC 57660 

AACACAAGAT GCTACTGTTT GATTTCCCAT CGTGTGGCTT ATATCGAAAC ATTCCATACG 5 7720 

CTTAATTTTA GGAAGATTTA ACAGATCACA CAACGCTTGA TAACGCTCTG ACATTCGTGA 5 7780 

AGATTGTTTA AGTTGTACAT TTAAAGCCGC TTT AG C ATT A ACTTGTGCGA G CTGTAAGT A 57840 

TTTACTTTTA TCGCCTTTTA CGCTTTCTTG AATCGTCACT TTTCTGCCCG CTTGCTCGCT 57900 

AAGAAGTTGC TCAAGCTCCG ATTTTT CAG C TAATTGACGA TCCACAATAA TGCTATTCGG 5 7 960 

GATACTTCGT CCTTGATGAC CTTGTAAATA AAATTGCCCC ACAAAAGTTT CGGTAAGTTC 5 8 020 

CGATAAATCC GTATTGGCGG GGACTTTTGG GAAATAACTG CGATTGCCTA GCACTTTTCC 5 808 0 

TTGACGAATA AACATC AC CT GTACACAGGC TAACCCGTGT TGGTAAGCGA TAGACATAAT 5 814 0 

ATCCATATCA TCTAAACGTT CATTGGATAC AAATTGTTTT TCTATAACCG AACGCACAGC 58200 
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CTGAATTTGA TCGCGATAAC GAGCAGCCTG TTCAAAATCT AAATTTCTGC TTGCCTGCTC 5 8260 

CATTTTACCG ATTAAATAAT CTAACACCTG TtGATCTTTG CCTTGTAAAA ATAAGCGAGC 5832 0 

TAATTCAACT TGCTGATTAT ATTCTTCATC GGAAACATAT CCCTGTACAC AAGGTGCAGA 58 380 

ACAACGCCCA ATTTGATACT GCAAACAAGG ACGAGAGCGG TTGGAATAAA CGGAATTTTC 5844 0 

ACATTGGCGA ACAGGGAAAA GTTTTTGCAA AAGCGATAAC GTTTCACGCA CGGCTCCCGC 58 50 0 

GTGAGGATAA GGCCCAAAAT ATTCACCTGC AAATTTTTTA GAACCACGAT AAGAAGTAAT 58 560 

TCTTTGGTGG CGTTCTTTGG TTAATAAAAT AAATGGATAG GATTTATCAT CACGCAATAA 58620 

TACGTTATAA CGAGGCTGAT AAAGCTTGAT AAAATTATGC TCTAGCAGTA AAGCTTCCGT 5868 0 

TTCAGATGAA GTTAGCGTGG TGTCAATATG ATGAATTGAC GCCACCAACG CCTCTGTTTT 58740 

TTTACTGCTC AGATTTTTAC GAAAATAGCT TGAAAGACGT TTCTTTAAAT CCTTTGCTTT 58800 

CCCAACATAA ATAACCTGAT CTTTATCGTC ATACATAcGA TAAACACCAG GTTCGTGAGA 5886 0 

GACATCAGTG AGGAATTTTT TAGCATCAAA CATTTTAATT CGCCGCTAAA ATTGCCCGCA 58 920 

CTTTTTCCAA ATCTTCGGCG GTATCCACCC CCACTGCAGG CACTTCTTTC GCAAGTTCTA 58 980 

CGTGAATACG TTCGCCGTTA TATAACACAC GAAGCTGCTC AAGTTTTTCT AGATTTTCAA 5 9040 

GTTGAGTCGG TGCCCATTGC ACATATTGTT TTATGAAACC CGCACGATAT GCGTAAATGC 59100 

CAATATGACG TAAGTAAGCG TCAGAAAGCT GTACTTTCTG AACATCCTGC AAATTCATAA 5 916 0 

ACTGATCACG AT CATAAGG A ATAACCGAAC GGGAAAAATA CAGCACATAG CCATCTTTAT 5 9220 

CTGTTAATAC TTTCACTGCA TTTGGATTAA ATAATTC CTC AGCATCGTGA ATTTTTACCG 5 9280 

CAAGGCTTGC CATATTGACA TTAAATTTTG CTAAATTATC TGCCACTTGT CGCACGATAA 5 934 0 

CGGGAGGAAT CAAAGGCTCA TCGCCTTGAA TATTGACAAT GATTTCATTG TCAGGAATAG 5 9400 

CTAATTTTTC TACAACTTCT GCTAAACGTT CTGTACCAGA ATTATGATTA ACCGAAGTCA 5 9460 

TACAGACTTC AGCACCAAAA CTTTTGGCAA CATCAGCCAC ATTTTCATTA TCGGTGGCAA 5 9 520 

TAATCACTCG GCTCGCCCCA GACTGCAGTG CTTTCTCAAA TACATGTTGA ATCATTGGCT 5 9580 

TACCTTTAAT ATCAGCAAGA GGTTTTCCTG G CAAACG ACT TGATGCAAAA CGAGCGGGGA 5 964 0 

TAATCACGGT AAATGACATA ATTTATTCCT TATTCTTTAA TATTTCAGCT TTATCGGCAA 5 97 00 

GCAATCAAAA CAACTGGGGA TTAATCCTTT CAGCCATTTC TGTATTGCTC CACAAGTTTG 5 976 0 

TCTATTTTGG CCCAAAAGTG CGGTAAATTT TCACGTTGTT TTTCAGCCTC AATAATCTCC 5 9820 

GCATCCACAG GGACATACCA CCAATTATCT TTAGCAAAAG ATTGGCATTT TACGGCATCT 5 9880 
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TTTTCCGTCA TAAAGAGCGG TTGATTTTCA GCAAGTTTTT CTAATTGAGA CGCTTCAAAA 5 9940 
TGTTGATGAT CTTGAAATGC TTGAGTTTGC TTTAACTGAA TC CCTAACTT TTCTAACATA SO 000 
GTAAAAAAAC GCTGTGGATT GCCAATCCCT GCGATGGCAA CACCAGATTG AAATTCATTT 6006 0 
AATTGGCGTT TTTCATTGGT TTTTAAATTA ATCGCGAAAT GAGGCACAAG ACGCATAACT 60120 
G CATCTG AAT ACTGATTTTT TCCACCATTA GTGATCACAA AATCGACAGA TTTTAATCGA 6018 0 
CTTGGTAATT CACGCAATGG ACCTGCTGGC AATACAAAAC CATTTCCCAA TGCGCGCTCA 6024 0 
GCGTCCATTA CGACAATTTC TAAATCACGT TGTAATTGAT AATGCTGCAA ACCATCATCA 603 00 
GAAATAATAA TATCGCACTC TGCTTGGCTT AAGAGTAATT CAATCGCTTG CTGGCGATTC 6036 0 

GGGGAAATCA CAACTGGCGC ATTAGTACGT TTAGCGATCA ATACAGGCTC ATCGCCACCT 6 0420 

TCAATTGGAT TTGTATTTTT AGTGACGAAT AACGGATAAG TTTTAGATTT ACTGCCGTAA 6 0480 

C C ACG AG AAA TCACACCTAC ACG CAGACCT CGTTTTTTTA ATTCTTCCAT AAGCCAAACA 60540 

ACCACAGGCG TTTTGCCATT TCCACCCACA GACAAATTTC C CACAATTAT CACTGGTTTT 6 0600 

GGTGCGCGAT AAGAAGACTT CAGCCCCAAA GAGAATAAGG CGCGACGAAG TTGGCTAATC 6 0660 

AACCAAAACA ATAAAGAAAA AGGCGATAAG AGCCAAATAA GTTTGGAGTT GGAATACCAG 6 0720 

AAGGG CAT AG AAAATCCTTT CTAATAAGTG CGGATAAAAT AACGCGAGTT TTTCAACTCG 60780 

CGTTTTCGAT AATCAATAAA TTAGCCAGTA AACTG CATAC TATGCAACTG TTTATAGGCA 6 0840 

CCATTTTGTT CAAGCAATGT TTTATGGTTG CCACGCTCAC GAATTTCTCC GTGATCAATC 6 09 00 

ACAAGAATTT CATCCGCGTT TTCAATAGTA GATAATCTAT GGGCAATCAC AACAACCGTG 6 096 0 

CGATCTTTCT TTAATTCCTC TAATGCAGAT TGGATTG CTC GTTCTGATTC CGTATCTAGT 61020 

GCAGATGTAG CTTCATCTAA AATTAATACT GGCGAATTAC GCAATAAAGC ACGAGCAAtC 61080 

GCTAAACGCT GACGTTGCCC ACCTGATAGG CTAGTG C CAT TTTCGCCAAT TACCGTATCA 6114 0 

AaAACTTGCG GTAATTTTTC GATAAACTCT AAAGCATAAG CCGCTTTTGC TGC AG CG ATA 612 00 

ATTTCTTCGC GAGAATACTT ATCTTGTGCT GCATAAGCAA TATTATTCGC AATAGTATCG 6126 0 

TTAAATAAAT GGACTTGTTG AGAAACCACA GCGCAGTTTT CACGTAAATT AGATAAACGA 613 20 

TAATCTTGGA TATTTACGCC ATCCAGTAAA ATTTCGCCTT GCTCAATATC GTAAAAACGT 61380 

GTCACTAAAT TAGCAATGGT TGATTTGCCC GATCCAGAAC GTCCCACTAG AGCTACGGTT 6144 0 

TTTCCAGCTG GAACGCTAAA AGAAATATTA TTTAATGCAA GTTCATCTTT TCCTTGATAT 61S00 

GCAAAACTCA CATTTTTAAA TTCTAACTCG CCTTTCGCAG GTTCTGCTTT ATAAGCCCCG 61560 
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TCATCTTTTT CTGGCTCTAA ATCTAAAATG GCAAATAGCG TTTGACATGC TGCCATTCCA 61620 

CTTTGAAATT GTGCATTCAC CGCAGTTAAT GATTTTAACG GACGCATCAT AGCTAACATT 61680 

GAAGAAAATA CCACTGTAAA TGAGCCTGCA CTCAAATTAT CTTCTGCAAT CAATGGAGTG 61740 

GTAGCTAAAT AGAGCACCGT AGCTAATGCG AGAGATG CAA TAACTTGCAC CACAGGATCA 61800 

GAAATGGAAT TTGCTGTCAC CATTTTCATG CTTTTTCGAC GCATATCATT ACTAACATGA 61860 

TTAAAATGCA CTTCTTCCAC ATGTTGCCCT CCGAAAGAAA GCACAACTTT ATGACCTTTT 61920 

AACATCTGTT CTGT AG C AG A CGTAAGTTCA CCCATTGAAT CTTGTAAGTT CTTACTTAAT 61980 

CTACGAAAAA TTTTTGATAC CAAACGAATT AAAACAGCAA TGATTGGACC TATAATAAAA 6 2040 

AG CACAATTG TTAATTCCCA ACTGGTATAA AACATCACTG CGAATAGCGA AATAATATAT 6 210 0 

GCTCCTTCTC GCACAATTGT AATCAAAGAT CCTGAAGAAG AACTTG CAAT CATTTGGGAA 6216 0 

TCATAAGTAA TCCGAGAAAG TAAACGCCCT GTTGAATTTT GATCAAAGAA ACTCACTGGC 62220 

ATAAACATTA AATGTTTAAA CAAGCGGCGA CGCATTGTCA TGACAACTTT GCCTGATACC 62280 

CACGCCAAGC AATAATTAGA AATAAAGTTG GTAATGCCGC GTAAAATAAT CATCCCAACG 62340 

ACGACAAAAG CCATCATTTT CAAAAATGAA TGGTTTGCCT TACCAAAACC ATCGTCCAAC 624 00 

AACGGTTTTA ACAAATAAaT CAAGCCTGAG TCAG CTAATG CATTAAAAAC TAACGCGACG 62460 

CCAGAAACAA TAAGCCCCGC TTTAAAAGGT TTAATCATTG GCCACAAACG CTTAAACGTT 62 520 

TGTAAGGTTG AAAAATCATT TTCTTGTAAT TTTTGCTCTT GCATAATCGC TATCTAATAG 6 2580 

TAAAAATTGC ACGCATTGTA CCTTTTATTC CTTTGATAAT C C AATT AC AC GCGCATACCA 62 640 

AGGGGAAAAT TTTGTGCGAG CTTGCTGGAT TTCTAATCGG TCTTGAAAAA AATTTACCCG 62 700 

CACTTGCCCC GAAACAGCGG TATTTTCTAC CGCACTTTTA TAG CG ATGAA GCCTTTCCAT 62 760 

AACCGAATAA TGAGGGAATT TCCACGGATT CCAACGCCCA CTAGAAATAA T CG CT AC AT C 622 2 0 

TGGTCTAACC TGAGAAAGCA AGTATTCACT TGTCGATGTT TTACTCCCAT GATGCCCCAC 6288 0 

TTGCAACACA TCGATTTTGC CTAAAGTGCG GGCAAAAATT TGCTCATTTT TTGCTTCAGC 62940 

ATCGCCAGTT AG C AAAACG C TATTCTTTCC ATCATCGACT AAAAT C AC AC AAGAATGGGA 6 3 000 

ATTATCAGCT CGTGTCACAA CGTTGTGAGG AGAAAGTATT TGAAAATGCA ACCCTTGCCA 6 3 060 

ATGCCAATCA CGCCCAGCAG TACAAAAAGT GCGGTAATTT TCCCCATAGT TTTTC CGTGA 6 312 0 

AGGGGTAATC AATTCCACAT TGGGATACGC CTTTAAAATT GTCGAAGCAC CACCTGCGTG 6 318 0 

ATCGTTATCG TCGTGGCTTA AAATCAATTT TTCCAAAACA ATCCCTTCTC TTTGTAAATA 63240 
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AGGCAAAATT TCCAACTCAG CCATACTTCC ACCTCGCCAA GAAGAACCCG TATCATAAAG 6 33 00 

AATCCCTTTA CCATTTTTCA CAATCAACGT AGCTAACCCC TGCCCCACAT CTAAAGTATC 6 3 3S0 

TACCTGCCAA GTTGGCTTGC TCAATTGTTT AAACAACAAA ATCGTAAAAC ACAGTAAGAT 6 3420 

AATGCCGAAA GAACATCGCA AAACGTTGAT TCGTTCATTT TTTAGCAAAG GCTTACTGAG 6 3480 

ATTTAATGTA AAAAATTTTG CTCGTTTTAT TTGCCAGTTT GATGATGAAA TCTCGGGTTC 6 3 540 

TCGGTAAATA TTCCAAATAA TTAACATAAA AATTCCTGCA CAAAGTGCGG TTAAACCCAA 6 3600 

TGCTAAATTA AATGAAACCG TGAGCCAATT CCCTTGAAAA ACAGAAATTA ACCCAGTAAT 6 3660 

TCCTTCGGCT AGCTTGTTTG CTAGTTGCCA AGAAAACATT GTGCCGTTAG TAAAAACGGC 63 720 

AAATAAAATT AATGGCACAA GCAAAAAACT ATAAATTGGA ACCGCCATAA AATTGGCTAA 63 78 0 

AAATCCACTC AACGATAAGC CATTAAATAG AAAAAGTTGC AAAGGTGTAA AAAAGAGCAA 6384 0 

TAACCCAAAT TGCAAATGAA ATAAACTAAA AATCCACCGC ACTTTTGGAG AAAAAGGGCG 63 900 

ATTTTTCCAT TGAAAAAGTG AAAAAGGCAC ATAACGATAC CAGAGGAGCA AACACCCAAC 63 960 

CGCTCCACAA GAAAGCCAAA AACTGACCGA AAGCGGCATT AATGG ATCG C AGAAAAGCAA 64 02 0 

GAATCCGACA ACCAACGTAA AAAACTGAAT GGGCGAATAA TGTCGCCTCA TTATTTGAAT 64080 

AAATAAAACG AAAACAAGTG CTGAAATGGC ACGAAAAGTT GGCACACTAA AACCAGCCAA 64140 

ATACGCATAA ATTAAAGCAA ATAAAACACC AAAAACTAAA GGAAAATAAG GATGAATAAA 64200 

ACGCGTAGGA AAGAACACTT GCACAACACG CGCCAAGcAA AATCCAATTC CCATAGCCAA 64260 

CCCAATATGT AAGCCAGAAA TAGCAATAAG ATGTGCGGTA TTGGTTTGTT GGTAAATTGA 64 32 0 

CCAAGTGGTT TTATCTAACC AAGCCCGTTC GCCAAAAGCT AAGGCAATCA ATAAACCTTG 6438 0 

TAGAGATAAT CCTTCCGTTT GCTTCTTTAC TTGTTGCAAT TTTTCTGCAC GCAATGATGA 64 44 0 

AACATCCGCA ATTTTCACCG CACTTTTTAC CGTTCCAACA GCCGTAATTC CTTTTGAAAA 64 500 

ATACCATTGT TGCCGATCAA ACCCACCGAA ATTTAATCGT GCAGAAAGGG AACGTAAAGA 64 56 0 

AATTTCAGCT TGCCAAATTT CCGATAATTG AGGCACCTCT TTCGCTTTCC AATTTAAGAA 64 62 0 

AATTCGTTGT TCTTGCAAAT TATTCTCCAA TGTTGCTGTG GCGATAAGCG TTTGATAATC 64680 

CTGTTGGTGC AAAATTTCTT GAATCTTAAA AGTTACCACT TG CTTTTG AG CGGTAATATT 64 740 

TTGAGCTTGT TGTGATAAAC TCAAAGCCGA ATAATGAAAA TAGCCAAGAG AGGCAACAAA 64800 

AAAAGCAAGG CTAACTAAGA AATTTCTACG CAAGAAAATA AAAAGAAAAA TCAAAACAAG 6486 0 

CGCAATAACG AGAGCAACTT GCCAAGGCAA TAGCAACGGT TGCGGTAGAA ATAACAACGT 64 920 
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64980 
65040 
65100 
65160 
65220 
652B0 
65340 



TAAATCCGCG ACAATTAACA AGACAACTAA AGTTATTAAG TTTAATTTCA TTTTTATAAA 
AGATTAATTC CAGCCTCTCT AAGCCTAGGC TAGTTTTATT TAAATATGCA AACGGTTGCT 
TCAAATTTTG CGATCCATAT CGTAAAAATA AATTTTTCAG GAAAATTTAC TTGCTAAAAC 
AAACAACTTC TGATATTTAT ACCGCCCGCA AAAATTGGGT TCCTAAAATA GCAGTCAGGA 
ACAAACTAAA ATAACAAGAC AGGAAGTCGT GAAATCCACG CTTAATTCTG TATGTATTAG 
GAGTGAGAGA TGTCAAGAGC ATCATTGAGC TTGCTAGATT TAGCAGGCGT TAAGCCTTAT 
CAAATGAAAA AAGATGAAGA ATATATGAnT GAnGAGCAAA TTCTTCATTT CAGAAAAATT 

CTTAATGCTT GGCACGAGCA AATCGTGGAA GAAGCTTCTC GCACCGTTGC ACATATGCAG 65400 

GATGAAGTAA CCAATTTCCC CGATCCTGCA GACCGAGCAA CTCAAGAAGA GGAATTTAGT 6 5460 

CTAGAATTAC GTAATCGAGA TCGCGAACGT AAACTAATGA AAAAAATCGA AGCAACCTTG 6 5520 

AAGAAACTTG ATACCGATGA TTTTGGTTAC TGCGATTGTT GTGGTGAAGA AATCGGCATT 65580 

CGCCGTTTAG AAGCGCGCCC AACAGCAGAT CTTTGCATTG ATTG CAAAAC TTTAGCTGAA 6 5640 

ATCCGCGAAA AACAAGTCGC TGGTTAATTC ATTCTATTGA AAAATACAAA AGTG CGGTTA 6 5700 

AAATTAACCG CACTTTTTTG CATTTAAAAA TGCAGGAAAA TCCACTTGCC TTTTTCTTTA 65760 

CAAGAAAAGA GCCAAGTCAT TTATTTCCAA CAACAAAACG GAGTAAATTA TTCTTACTTA 6 5820 

TATCAAAAAT TTGTTTGGCA AGAAGACAAA AAAATCAAAA GAGGTTGCCA AAACAAGTGC 6 5880 

TCATTCTAAA CGAGTTGAGC GTTCTAATAG CCAAATGCCA AAAGCGCGAG CAAAAAAATC 6 5940 

AGAACAAACA CGTCGCTATG ATAAAAATGT GATTAAAGCG GCTCAATTTG ATATTTCTCC 66 000 

GCGTGATTTT AGCCGTAATG CTCTCAATGT AGTGGAAAAA CTGCAACGTC AAGGTTTTGA 6 6060 

GGCTTATATC GTCGGTGGAT GTATCCGCGA TTTATTGCTC GGTAAAAAAC CAAAAGATTT 66120 

CGATGTTGCA ACAAACGCCC GCCCTGAGCA AATCCAAAAC ATTTTCCAAC GCCAATGCCG 6618 0 

TTTAGTGGGT CGCCGTTTCC GCTTAGCACA TATTATGTTT GGTCGAGATA TTATTGAAGT 66240 

CGCCACTTTC CGTGCTAATC ACAGTGATGC TCGCAATGAA AATCAAGCGA AACAGAGCAA 66300 

TGAAGGCATG TTACTGCGCG ATAATGTTTA CGGTACGATC GAACAAGATG CAGCACGCCG 6 6360 

TGATTTTACG GTCAATGCAC TGTATTACAA TCCGCAAGAT AATACGTTAC GAGATTATTT 6642 0 

CGAGGGCATT AAAGATCTCA AAGCAGGCAA GCTACGCTTA ATTGGCGAtC CTGTTACACG 66480 

TTATCAAGAA GATCCTGTGC GCATGCTACG CTCAATTCGT TTTATGGCAA AACTTGATAT 66 540 

GTTCCTTGAA AAACCGAGCG AACAAC CAAT TCGTGrACTT GCGCCTTTAT TAAAAAATAT 6660 0 
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TCCGCCAGCT 


CGCTTATTTG 


ACGAAAGCTT 


1 AAAATTATTG 


CAGGCGGGAC 


AAGGGG TAAA 


66660 


AACTATCGGT 


TATTACGCCA 


, ATATGGTTTA 


TTTGAACAAC 


TTTTCCCTGC 


ATTAAGCGCA 


66720 


TATTTTACAG 


AGAAAGAAGA 


TAGCTTTGCG 


GAACGTATGA 


TCGTCACTGC 


ACTTACTTCT 


66780 


ACGGATGAAC 


GTGTTGCGGA 


TAAACTTCGT 


ATTAATCCTG 


CATTCTTATT 


TGCAGCATTT 


66840 


TTCTGGTATC 


CATTGCGCGA 


AAAAGTGGAA 


ATTTTAAAAA 


ATGAAGGTGG 


TTTAAATAAT 


66900 


CATGATGCTT 


ATGCTTTAGC 


AGGCAATGAA 


GTTTTAGACT 


TATTCTGCCG 


AG CTTTAGCT 


66960 


GCACCACGTC 


GTCATACTGC 


GGTCATTCGT 


GACATTTGGt 


TCTTACAACT 


GCAATTATTA 


67020 


AAACGTACTG 


GATCAGCACC 


AATGCGTACC 


ATGGAACATC 


CAAAATTCCG 


TGCAGGCTTT 


67080 


GATTTATTGG 


CTATGCGAGC 


AGAAATTGAA 


GGTGG CGAAA 


CCATTGAATT 


AGCCAAATGG 


67140 


TGG CACGAAT 


ATCAATTCAG 


CAATGGTGAA 


CAGCGTGAAC 


AGTTGATACA 


AGAGCAACAA 


67200 


CGCTTACATC 


CTAAACCAAA 


GAAAAAATAC 


TACCGTCCAC 


GCCGTCGTAA 


AACAACGTGC 


67260 


TCAGCAGAAT 


AAGGGAAAAG 


ATGATTACGG 


CATATATTGC 


TCTAGGTAGC 


AATTTAAACA 


67320 


CACCAGTAGA 


ACAATTACAT 


GCTGCGCTTA 


AAGCGATAAG 


TCAGTTATCA 


AACACTCATT 


67380 


TGGTAACAAC 


GAGCTCTTTT 


TATAAAAGCA 


AACCGCTCGG 


CCCACAAGAT 


CAACCCGATT 


67440 


ATGTGAATGC 


CGTTGCAAAA 


ATTGAAACCG 


AACTTAGTCC 


GCTCAAGCTA 


TTAGATGAAT 


67500 


TACAGCGTAT 


TGAAAACGAG 


CAAGGTCGAG 


TGCGTTTACG 


TCGCTGGGGC 


GAACGCACGT 


67560 


TAGATTTAGA 


TATTTTGCTC 


TACGGCAATG 


AAATTATTCA 


AAACGAACGC 


TTAACAATTC 


67620 


CCCATTATGA 


TATGCATAAC 


CGTGAGTTTG 


TCATTGTGCC 


GTTGTTTGAA 


ATCGCATCGG 


67680 


ATTTAGTTTT 


ACCAAATAGT 


CAAATCATAA 


CAGAGCTGGT 


CAAGCAATTT 


GCTGACCACA 


67740 


AAATGATAAA 


ATTAAATCCC 


T AG AAATTT C 


ACCGCACTTT 


TATGAACCTC 


AAAAGTGCGG 


67800 


TAGAATCAGC 


CCACATTTTT 


CTTCATTCAT 


GACCCGTTAT 


GGAAAGTTTG 


ACTCAATATA 


67860 


TCCCTGATGA 


ATTTTCTATG 


CTCCGCTTCG 


GCAAAAAATT 


TGCCGAAATT 


CTTTTAAAAT 


67920 


TG CAT AC AG A 


AAAAGCAATT 


ATGGTTTATC 


TTAACGGTGA 


TCTTGGGGCA 


GGAAAAACAA 


67980 


CGCTAACGCG 


CGGAATGTTG 


CAAGGTATCG 


GTCATCAAGG 


TAATGTAAAA 


AGCCCAACTT 


68040 


ATACGCTGGT 


TGAAGAATAC 


AATATTGCAG 


G CAAAATG AT 


TTATCATTTT 


GATTTATATC 


6 8100 


GTTTAGCAGA 


TCCTGAAGAG 


CTCGAATTTA 


TGGGCATTAG 


AGATTATTTT 


AATACGGATA 


68160 


GCATCTGCTT 


AATTGAATGG 


TCTGAAAAAG 


GTCAAGGCAT 


ATTGCCAGAA 


GCGGATATTT 


68220 


TAGTGAATAT 


TGATTATTAC 


GATGATGCAC 


GAAATATTGA 


ATTAATCGCA 


CAAACAAATT 


68280 
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TGGGTAAAAA TATTATAAGC GCATTTTCTA ATTAACTTAT TTATGAAAAC TAAAATTTTA 6 8 340 

TTTTTCCTTT TTTTCTCAAC GTTTAGTTTC TCTATTTTTG CAGCACCTAT TACGATTGCG 68400 

ATTGACCCAG GTCATGGGGG AAAAGATCCT GGTGCAATCA GCCGAAACTT AGGCATTTAT 68460 

GAAAAAAATG TTACCCTTTC AATTG CAAAA GAATTAAAAG CCTTATTAGA TAAAGATCCA 6 8520 

CATTTTCGCG GTGTATTAAC GCGTAAAAGT GATTATTATA TTTCCGTGCC AGAACGTTCA 6 8580 

GAAATCG CAC GTAAATTTAA AGCGAATTAT CTTATTTCCA TTCATGCCGA TTCATCTAAA 6 8640 

TCACCTGATC GTCGCGGGGC TTCTGTTTGG GTATTGTCAA ACCGTCGAGC CAATGATGAA 6 8 700 

ATGGGTCAAT GGTTAGAAGA CGATGAAAAA CGCTCGGAAT TACTTGGTGG AGCGGGAAAG 6 8 760 

GTGCTTTCAC ACAATAATGA CAAATATCTT GACCAGACTG TACTTGATCT GCAATTCGGA 6 8820 

CATAGCCAAC GTACAGGCTA TGTGTTAGGC GAACATATTT TGCACCATTT TG CTAAAGT A 688 80 

ACAACATTAA GTCGCAGTAC GCCACAACAT GCAAGTCTGG GTGTTCTACG TTCGCCAGAT 6 8 940 

ATTCCTTCTG TGTTGGTAGA AACTGGATTT CTTTCCAATT CAGAAGAAGA AAAAAAACTT 6 9000 

AATTCCCAAA CTTATCGTCG CCGCATTGCT TATATGATTT ATGAAGGCTT AGTTGCATTC 6 9060 

CATAGCGGTA AAACCAACAC TTTAGTTAAA GACAATTTAG TGCAAAACAT CAAACAAAAC 6 912 0 

GATATAAAAA AATCAGGGAA AAATAACCGC ACTTCAGAAC AAAATATAAA TGAAGATAAT 6 9180 

ATTAAAGACA GTGGCATTCG CCATATTGTA AAAAAAGGAG AAAGTTTAGG TAGCTTATCA 6 9240 

AATAAATATC ACGTAAAAGT CTCTGATATT ATTAAATTAA ATCAATTAAA GAGAAAAACA 69300 

CTTTGGCTAA ATGAAAGTAT TAAAATCCCT GATAATGTCG AGATAAAAAA TAAATCATTA 6 9360 

ACAATTAAAG AAAATGATTT TCATAAAAAA CAAAATAGCT TAGTAAATAA TACCAATAAA 6 9420 

GATTTAAAAA AAGAGAAAAA TACACAAACT AATAATCAAA AAAATATAAT TCCGCTCTAT 6 9480 

CATAAAGTAA CAAAAAACCA AACACTTTAT GCTATTTCGA GGGAATATAA CATTCCAGTA 6 2540 

AATATACTAT TGAGCCTAAA TCCACATTTA AAAAATGGCA AAGTTATTAC AGGACAAAAA 6 9600 

ATAAAATTAA GAGAAAAATA AATGCCTATT AAAATTCTTT CCCCACAACT TGCTAACCAA 6 9660 

ATTGCTGCTG GAGAAGTGGT AGAACGCCCT GCATCAGTGG TAAAAGAATT AGTGGAAAAT 6 9720 

AGCCTTGATG CAGGTGCAAA TAAAATCCAA ATTGATATCG AAAATGGTGG GGCAAATTTA 6 9780 

ATCCGCATTC GTGACAATGG CTGTGGTATT CCAAAAGAAG AATTAAGCCT CGCTTTGGCA 6 984 0 

CGACATGCCA CGAGTAAAAT AGCCGATCTT GATGATCTCG AAGCGATTTT AAGTTTAGGT 6 9900 

TTTCGTGGCG AGGCATTAGC TAGTATCAGT TCCGTATCAC GCCTAACCCT TACTTCTCGT 6 996 0 
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ACCGAAGAAC AAACTGAAGC GTGGCAAGTT TATGCGCAAG GTCGTGATAT GGAAACCACA 7 0 020 

ATCAAACCCG CATCACATCC TGTCGG CAC A ACAGTTGAAG TCGCCAATCT TTTTTTCAAC 7 0080 

ACACCAGCCC GCCGAAAATT TTTGCGCACA GATAAAACAG AATTTTCC C A TATTGATGAA 7 014 0 

GTCATTCGTC GTATTGCTTT AACGAAATTC AACACAGCAT TTACATTAAC G C AT AATGG A 70200 

AAAATTATCC GTCAATACCG TCCTGCCGAA GCAATCAATC AACAACTAAA ACGTGTCGCC 7 0260 

GCTATTTGTG GGGATGATTT TGTCAAAAAT GCCTTACGAA TTGAGTGGAA ACACGATGAT 7 0320 

TTACATTTAT CAGGTTGGGT GGCGACACCA AATTTTAGCC GAACTCAAAA TGATTTAAGT 7 0380 

TATTGCTATA TTAATGGCCG AATGGTTCGA GATAAGGTAA TCAGCCATGC TATTCGACAA 70440 

GCCTATGCAC AATATTTACC AACCGATGCT TATCCTGCGT TTGTGTTGTT TATTGATTTA 70500 

AACCCGCACG ATGTAGATGT CAATGTTCAT CCAACCAAAC ATGAAGTACG CTTTCATCAA 7 0 560 

CAACGCCTCA TTCATGATTT TATTTACGAA GGCATAAGTC ATGCGCTAAA CAATCAAGAA 70620 

CAGATTAATT GG CAT AC AG A CCAAAGTGCG GTAGAAAATC ACGAAGAAAA TACGGTGCGT 7 0680 

GAGCCCCAAC CGAATTACAG CATTCGTCCT AATCGTGCAA CTGCGGGGCA AAATAG CTTT 7 0740 

GCGCCACAAT ATCATGAAAA ACCACAGCAA AATCAACCGC ACTTTTCTAA CACACCAATG 70800 

TTTCCAAATC ATGTAAGTAC TGGTTATCGA GATTACCGTT CTGATGCACC GAGTAAAACT 7 0860 

GAGCAACGTT TATATGCTGA ATTGTTACGC ACGTTACCGC CTACAGCGCA AAAGGATATA 7 0920 

TCGGATACAG CACAACAAAA TATTTCAGAT ACAGCTAAAA TCATCTCAAC TGAAATAATT 7 098 0 

GAGTGTTCAT CACATCTACG AGCCCTTTCA CTTATTGAAA ATCGTGCGTT GTTATTACAA 71040 

CAGAACCAAG ATTTCTTTTT GCTTTCATTG GAAAAATTAC AACGCTTGCA ATGGCAACTA 7110 0 

GCGTTAAAAC AAATACAGAT TGAACAACAG GCTTTATTGA TTCCAATCGT GTTCCGATTA 71160 

ACCGAATCAC AATTTCAAGC ATGGCAACAA TATTCAGATG ACTTCAAAAA AATTGGCTTT 7122 0 

GAATTTATCG AAAATCAAGC ACAATTACGC TTAACCTTAA ATAAAGTGCC AAGTG C ATT A 71280 

CGCACACAAA ATCTACAAAA ATGCGTGATG GCAATGCTGA CTAGAGATGA AAATTCATCA 7134 0 

TCATTTTTAA CCGCACTTTG CGCACAATTA GAATGCAAAA CCTTTAACGC TTTAG CGGAT 714 00 

GCACTCAATT TATTGAGCGA GACGGAACGT TTACTCACGC AAACAAATCG CAC AG CATTC 714 6 0 

AC C C AATT AC TCAAACCTGT TAATTGGCAG CCTTTATTAG ACGAAATTTA AATT C TT CAT 71520 

GAAACCAACC GCAATTTTTT TAATGGGCCC AACAGCCTCA GGTAAAACAG ATTT AG C CAT 715 80 

TCAACTACGA AGCCAACTTC CAGTCGAAGT GATTAGTGTG GATTCAGCTC TT ATT T AT AA 7164 0 
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GGGAATGGAT ATTGGCACAG CGAAACCATC AAAAGAAGAA CTTGCCCTTG CGCCTCACCG 
TTTAATTGAT ATTTTAGATC CATCAGAAAG CTATTCCGCA ATGAATTTCC GCGATGATGC 
GCTACGCGAA ATGGCGGATA TTACCGCACA AGGCAAAATT CCGTTACTAG TGGGCGGCAC 
GATGTTGTAT TACAAAGCCT TAATCGAAGG GCTTTCGCCC TTGCCTTCTG CTGATGAAAA 
TATTCGCGCC GAGCTTGAAC AAAAAGCAGC TCAACAAGGT TGGGCAGCAT TGCATACAGA 
ACTCGCCAAA ATCGATCCCA TTTCTGCCGC C CGAATTAAT CCCAGTGATT CTCAACGAAT 
CAATCGCGCT TTAGAAGTTT TTTACATCAC AGGAAAATCA CTCACAGAGC TGACTGAAGA 
AAAAGGCGAA GCCTTGCCTT ATGATTTTGT GCAATTTGCA ATTGCGCCGC AAGATCGCCA 
TGTTTTG CAT GAACGCATCG AACAACGCTT TCATAAAATG ATTGAACTAG GCTTTCAAGC 
AGAAGTGGAA AAACTTTATG CGCGTGGCGA TTTAAATATT AATCTGCCGT CAATTCGCTG 
TGTAGGCTAT CGTCAAATGT GGGAATATTT GCAAGGTGAT TACGCTTACG AGGAAATGAT 
TTTCCGTGGT ATTTGCGCTA CGCGCCAACT TGCAAAACGC CAGCTTACTT GGCTGCGTGG 
TTGGAAAACA CCAATACAAT GG CT AG AT AG TTTACAACCT CAACAAGCGA AAGAAACTGT 
ATTACGTCAC TTAGATTCTT ATCAAAAAGG TTAGAAAAAA TGACCGCACT TTCTGCCCTC 
ATCGAACAAA AATTACAATC TCTTGGCACA ATCCTGTGCC AATCTTTTCC AGAACTTTCC 
CCAGAAAAAA TTCACGAAGC CATTCAACAA AATTCAAATC ATCCAGAACT GCCTGTTTAT 
CAATTAGGCT ATGCCATAGC AATGTCTGAC TTTGTGGAAA AAGTCTTGCA AAAATACCCG 
CACTTAGTGA AACAATGGTG GGAAAAATCC CCCTGTTTTG CAGATTGCAC TGCCTATATT 
GAACGTTTGA ATAAAGAGTT AGCCAATGTA CATGACGAAA CTGCTCTCTA TCAAACATTA 
CGCCAATTCC GTAACATTGA AATGGCAAAA TTAAGTTTTT GTCAAAGTTT GAATCTTGCG 
ACAGTTGAGG AAATTTTTGT TCAGCTTTCA CAACTTGCAG AGAGTTTAAT TATCGCTGCC 
CGCGATTGGC TTTATCATCA AGCCTGTGCG G AG AT AGG C A CACCAATGGA TGAACAAGGC 
AATCCGCAGC AGCTTTATAT TCTAGGAATG GGGAAATTAG GCGGCTTTGA ACTAAATTTT 
TCCTCAGATA TTGATTTAAT TTTCACTTAT CCATCACAAG GGGAAACGTC CGTTGAGAAT 
ACGAATGCTC GACGTTCTGT GGATAATGGA AAATTTTTTA CACGTTTAGG GCAACGCCTT 
ATTTCTGCCT TAGATGAATT TACCTCTGAT GGTTTTGTAT ATCGAACCGA TATGCGCTTG 
CGCCCATTTG GCGATAGTGG CGTGCTTGTT CTCAGCTTTT CAGCAATGGA ACAATATTAT 
CAAGATCAAG GTCGAGATTG GGAACGCTAT GCAATGATTA AAGGACGAGT ATTGGGCGCA 
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CAAACAAAGG ATCCCAACGT TTCAAAATTG AAAAAAATGT TACGCCCTTT TGTCTATCGC 7 3380 

CGTTACATTG ACTTTAGTGT GATTCAATCT TTACGTGAAA TGAAAGGCAA AATTGAGCGA 7344 0 

GAAGTACGCC GTCGTGGTTT GAAAGATAAT ATTAAACTAG GTGCTGGTGG TATTCGTGAA 73 500 

GTCGAATTTA TCGTACAAGT TTTTCAACTC ATTCGAGGCG GGCGAGAAAT CAATCTGCAA 73 560 

CAACATTCAC TCTTAAAGAT CTTGCCTGAA ATTACCAAAC TAAGATTAAT TACCGAACAA 73 62 0 

CAATTCCACG ATTTACGTGA AAGTTATATT TTTTTACGCC GCGTAGAAAA CATTCTACAA 73 6 80 

GCCATTAACG ATCAACAAAC G C AAAC ACT A CCAACCGATG AAATAGATCA AATTCGTTTA 73 740 

GTCG AGG CAT GTAAAACTTT CACCTGTTTA AATGAAAATA ATCAAACCAT TGAAAAACAT 73 8 00 

TATCCAATAG AAAATTGGGA CGATTTTTAC CGCACTTTAC AACAAAAACA AGAAAAAGTG 7 3 860 

CGGGCAGTAT TTACTCAATT AATCGGTGAT GAACAAGATG AGTCTTCACA AACTGATAGC 7 3 920 

CAATGGCAAG ATTTTTTAGA AGCAGATTTT GAGGATATTG AACAAACATT ATT AG AG AG T 7 3 980 

AGTGTATCGG AAAATAATAT TGATGAAGTT TTAGAAAAGC TTGCTCAATT CAAAGACGGC 7 4 040 

CTAAATCACA GAGTGATTGG TTCGCGTGGG CGTGATGTTT TGAGTCATTT GATGCCAACA 74100 

GTACTTGCAC TCATTTTTGA ACAAGAAAAT TACCGCACTT TGCTTCCGCG GATTCTGAAC 7 416 0 

ATTATTGAAA AAATTTCGAG TCGCACGACT TATTTAGAAT TATTATTGGA AAATCCTCGC 7 4 220 

GCACTTCAGC AAGTTATTGA ACTTTGTGCG CAGTCTCAAC TTATTTCAGA ACAATTGGCG 7 4280 

CACTATCCTA TTTTATTAGA TGAATTGCTT AATACG GAAG CACTACGTCA TCCTTTGCCA 74 34 0 

TTTACCCAAT ATCCTGCAGA GTTACACCAA TATTTGCTTC GTTTACCCCC TGATGATGAA 74 4 00 

GAACAGCTTA TCGATGCTTT ACGTCAATTT AAACAAGCAA CCCTATTAAA AGTGGCAGCG 744 6 0 

G C AG AT ATTT TAGGTGCATT GCCAGTAATG AAAGTAAGCG ATCATTTAAC CTATCTTGCT 74 52 0 

GAAGCCATTA TTGAAGTGGT TGTGAATCTA G CTTGG AAAC AAGTTTCCTC TAGATTTGGT 74 58 0 

GTACCTGAAC ATTTACAAAA TAACGAAAAA GGATTTTTAG TTATTGGTTA TGGAAAACTA 746 4 0 

GGTGGTATTG AACTCGGTTA TAAATCCGAT TTAGACCTCG TATTTTTGTA CGATGCCGTA 74 70 0 

GAAAGTCAAA CAACAGGTGG CAAAAAAGTG ATTGATAGCA ATCAATTTTA CCTACGCTTG 74 76 0 

G C AC AAAAG A TTGTGAGTAT TTTC AG CAT A AATACTTCTG CTGGCGTACT TTATGAGGCA 7482 0 

GATATGCGCC TGCGCCCTTC TGGCGATGCT GGCTTAATTG GCTGCTCTCT TTCTGCATTT 74 88 0 

TCTCATTATC AATTGAATGA CGCTTGGACA TGGGAAAAAC AAGCACTTGT GCGCGCTCGT 74 94 0 

CCAGTATTTG GCGAATCATC TCTTAAAGCT AAATTTGAGC ATATCCGCCA ACAAGTGCTT 7 5 000 
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TCTGCATCAC GAGATATTGA ACAATTAAAA CAAGATGTTG TTGAGATGCG GAAAAAAATG 
TTTGCCCATT TATCTCACTC AAAACACAGT GAATTTAACC TTAAAACAGA TCGCGGTGGC 
ATTACAGATA TAGAATTTAT TGCTCAATAT TTAATGCTTG CCAATGCCCC GAAAAAACCA 
CAACTAACAA AATGG TCAG A CAATGTACGT ATTTTCGATG ATATGGCAGA GGCTAAAATT 
ATTAGC CAAA CAGACTG CGA CACATTAAAA CAATGTTATG TCGATTTACG CAATACCATT 
CATCATTTAA ATTTGTTGGG TAAATCTTCC GTTGTGGAAG ACAGTGAATT TGTTAAAGAG 
CGGTCATTTA TTCAAGCATT TTGGGATAGG TTATTTGTAT CCTAAAATAT GGCTTGTGGA 
CGTTTCAAAA ATAGCCTTCA TCACAAATCT TCTTGAAAC C TCCCAAAAAC TAAATATTCA 
ATCCTATTTT TAATGAACCA AATCTAACAT TTCCTGTGCA TTGG CTAGAG CTAATTCTGT 
AATTTCACTT CCGCCCAAGA GTCTTGCCAA CGCAGGTACA CGTTCAGCTT GTGAAAGTGC 
GGTCATTTTT GTTTCAGTTT TATTATCAAC AGTAAATTTT TCCACATTAA ACTGATGATG 
GCCATGACAA GCAACCTGTG GCAGGTGGGT TACGCATAAT ACTTGACAAC GCTCACTTAA 
TTGACGTAAA AGTTTTC CGA CAACACTTGC TGTTTTTCCG CTAATGCCCA CGTCCACTTC 
ATCGAAAATT AAGGTTGGAA TAGCCGATTG ATCAGAAGTT AAAACTTGAA TGG CTAAAGA 
AATACGAGAA AGCTCACCAC CAGAGGCAAT TTTAGCTAAT GGCTGCGGTT GTTGTCCTAA 
ATTACTACGC AAAGTGAAAA TCACATTATC CGCACCATTT GCACTCACTT TATCTAAATC 
CGTATTCAAC TCTACATAAA ATTCAGCGTT TTCCATTGCA AGTTGTTTTA TCGACTGTGT 
GACATTTTGC GCTAAGCGGG TTGCAGATTG CTGACGACTT GCTGTTAATG CTGTAGCCGT 
TGCTTGCATT TGTTCAAATG CTCGTTTTTC TTGCTCAATC AACATCTCTT CACTTTCAGA 
AAAATCTAAA AGTGCGGTTA ATTCTGCTTT TAATTTTTTG TGCTGTTCTA CAAGATCTTC 
AGGTTTAACT TGATGTTTAC GAGCAAGTTG TAAAGTCTGA CTAATTCTTT GTTCGATTTC 
CTGTAAAAGC TGAGGATCTT GCTCGATATT ACTTGATAAA TTTTG CACTT CACTGGTTGC 
TTCTTGCACT TGAATCAACG CATCATTCAA CATATTTTG C ACTTCAGCAT ATTGCGGATC 
AAGTTC AG CT AATTCATCAA TATATTGTGT CGCACGATAA AGCAAACTAT CGATATTTAC 
GGTGTCATTT TCACTTAAAA TTTGTAGGGC TGATTGGGAA AGTTGAGTTA GCTGTTCGCT 
G CTAG AT AAA CGACGTTGTT CATCTTCTAA TTCCAAATAT TCATTTGGTC G AAGG TT AAA 
TTCATCTAAC T CATC CACTT GATACTGTAA AAGTTGTTTA CGTGCTTCAT TTTCCGCCAC 
TTTTTGTTGA AAGGTTTTAA CTTGATTTTG AAGATTTTTC CATGTCTGAT AATCTTCACG 
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CATTTTCACA 


AGTAATTCAG 


GATGAGCGGC 


AAAATTATCA 


ACTAACTGAA 


GCTGATAGTC 


76740 


ATTTTTTAAC 


AATAATTGAG 


AAGCGTGTTG 


CCCATTAATA 


TGAACGAGAT 


ATTGAGCAAT 


76800 


TTCTTTTAAT 


TGAGAAGCGG 


ATACTGGTGT 


ACTATTAATA 


AAAGC CTTAG 


AACGTCCGTC 


76860 


TGCATTAATT 


ATGCGACGTA 


AAATACATTC 


AGTTGGATTA 


TCTGGATCCT 


GTAATTCTTG 


76920 


CTCACTCAAC 


CAGTGATAAG 


CAGGATTTTG 


CGATTCTAGT 


TGGAAAGTTG 


CACAAATTTC 


76980 


CGCACGCTCT 


TGCCCTTCAC 


GCACCATTGA 


GGTTTCTACT 


CTTTGTCCGA 


AACATAAACC 


77040 


TAAAGCATCA 


ATGGCGATAG 


ATTTTCCAGC 


ACCAGTTTCC 


CCTGTTATAA 


CAGACATTCC 


77100 


TTTCGCCAAT 


TCAATATCAA 


GCTGACGGAC 


AATCGCAAAA 


TTATTGATAG 


TAAGTTGAGT 


77160 


AAGCATAGCA 


AAATCCTTTA 


ACTGTATATT 


TAATATGATA 


TAACTGACTT 


AATTTACAGT 


77220 


AAAGGATTTT 


TTACAGTATT 


TTGTCAAATC 


TATT TC AATC 


GAAAATTTTT 


ACCATAATCA 


77280 


AAATGATTTT 


AACCAACCTA 


ATTTGCTACT 


TAACACATTG 


TAATAATTAT 


AATTTTCAGA 


77340 


TGAAGCAATC 


GGAGTTTATG 


CTCGCTTTTT 


TGAATATGCA 


CCACATCATC 


TGGGGTAAAA 


77400 


GGTAAGGTAA 


TTTGACTATC 


ACAGCCCACT 


TCTAATTGAG 


AGGTATTATG 


TTCAGCAAAA 


77460 


CGAATCGATA 


TTTTACTATC 


CCCATCAACA 


ACAAGAGGGC 


GAGAAGTTAA 


TGTATGTGGA 


77520 


AACATTGGCA 


CTAATGCAAT 


GGCATTAAGG 


TTTGGTGTCA 


AAATAGGTCC 


ACCAGCGGAA 


77580 


AGAGAATAAG 


CCGTAGAACC 


TGTTGGAGTA 


GAAACAATTA 


ATCCATCAGA 


ACGTTGAGAA 


77640 


AATGCAAACT 


TATCATTGAT 


ATATACGTGA 


AAATCAATCA 


TATGTGCAAT 


TTTGGCGGGA 


77700 


TGAATAACCG 


CTTCATTTAC 


CGCATTGCTG 


GTTGATACGA 


TTTCACTTGC 


TCGTTCGATT 


77760 


TTTGCTTCCA 


ATAAAAAACG 


TTCTTCCACA 


AAAAATTCGC 


CACGTTCTAA 


ACAAGCTTCA 


77820 


AGCTGGGAAT 


AGGCATTTTT 


CGGGTCAATA 


TCCGTTAAAA 


ATCCCAAATT 


ACCACGATTA 


77880 


AT AC C AATC A 


ATGGAATATC 


ATATTTTGCT 


AATACGCGAG 


CGCGCCCTAG 


CATATTGCCG 


77940 


TCTCCACCAA 


TCACAATGGC 


TAATTGGGCT 


CGGTGGCCTA 


TTTCTTCTAA 


CGT AG CAAGA 


78000 


TGTTCAAAAG 


GTAACTCAAG 


TGTTATGGCG 


ACTTCTTTTT 


CCACCAACAC 


TTGATAACCA 


73060 


CGTTCCATTA 


ACCAATGAAA 


TAAGTTTTTG 


TGCATTTGTA 


AGTTAATATC 


ATTGCGAGGC 


78120 


TTACCTACAA 


GAG C AATGG T 


TTTAAAGGAA 


CGATATAAAT 


GATTCATAAA 




7qi on 


TTGAATTTAA 


CATTTTCGTC 


ATTATATCGT 


TTTTTGCTAC 


ACTATGCCCA 


ATTACGATTA 


78240 


ATCAATTTTG 


TTTTACGGAG 


AAAAAAATGT 


CAGAACAAGA 


ACAAAAAGTT 


GAAATCCCAG 


78300 


AAGTAGAAAA 


GCAAGAAGAA 


GTCGTTGTGG 


AAGAAACACA 


ACAAGCAGAG 


CATTCTCAAG 


78360 
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AATTTGATCC TTTAGAAGAA GCAATTGCTC GCGTGCAAGA GCTTGAAGAG CAGCTAAAAA 7 8420 

CTCAAATAGA AGAAGCGGCA AACAAAGAAC AGGATATTTT ACTTCGCTCT CGTGCTGAAA 7 8480 

TTGAAAATCT TCGTCGTCGT ACTGAACAAG ACGTAGAAAA AGCACATAAA TTCGCCTTGG 7 8 540 

AAAAATTTTC AAAAGACATT TTGAATACTA TCGATAACCT AGAGCGTGCC CTTGCCACTC 78600 

CAGCAAATAA AGAAGATGAA AGCGTAAAAG CCTTATTTGA CGGTGTAGAA CTAACCTTAA 7 8660 

AAGAATTAGT TTCAACCGTT GGGCGTTTTG GCGTGGAAGC TGTGGGAGTT GTCGGTGAAG 7 8 720 

CATTTAATCC TGATTTACAT CAAGCCATTT CAATGCAACC AGCTGAAGGC TTTGAAACCA 78780 

ATCAAATTAG CGTCGTATTG CAAAAAGGCT ACACCTTAAA TGGTCGAGTC ATTCGCCCAG 7 8 840 

CGATGGTAAT GGTGGCAGCT TAATTTACAT TTTGTCTAAA GAGCGGTCGG TTTTTCGACC 7 8 900 

GTTTTTTCTT TTTCCTGAAT AACTACATAA ACTTTACGGA TAATTTCCCT AAAATCTTCG 7 8 960 

CTCGTTGTAG CCCAATCTAG AAGATCAACG CGCCACGGTA AATCTGATTC AG AAAAAG CT 7 9020 

TCCTTTAATC TGTCACGAGC TAAAAAATCT AAAGGTTCCT CAGAGATAAT CGCAAGGTCA 7 9080 

AGATCGGAAT ATTTTTTTGC TTTTCCCTTT ACACGAGAAC CGAAAGCCCA CACGGTATAA 7 914 0 

TCAGGTACTA ATTGTTGTAA AATAGTTTTC ACAATCGCCA ATTCTTCAGA TTTTATATCA 7 9200 

AGCTGTGCAA AACTAGTCAT ATTGATTTCT CTGACGTAAT TGTTCCAATA GGAAACTACT 7 9260 

TTCTATTAAA AAATCATCAA TTTGTGCATA AACAGCCATG GCTTTTTCCT GATCATAAGT 7 9320 

ATG TGATGT A ATATTACGCA TATCACGATA AG C AAC C CAT TTTGACATAT CTCCAATCAA 7 9380 

ACCAAATCTC AATGCTTCTC GCAAAATATC CTTAAATCCA TAAGCCCCAA TGTCATCGGT 7 9440 

ATTAATTG C A TCTTGTTGAA GTTGGCGTTT CATCATTTTT AAACTTAACT CATAAACAAA 7 9500 

TTCAAATTTC TGAATTGCAC CTGCAATTAA GGTATCTTGT ACAATAGAGG GTTGCATATC 7 956 0 

AAACCAATTT CTGTCTGAAA TTTGTACTAC GGTTTGCTCT AACGAATAAA ATGCAGCATC 7 9620 

TAATACATTT AAATTTAG CT TATCAGTCAT CATTTTTACC TCTGTTAAAT ACTCATCTAT 7 9680 

TTT AC CAT AG ACTATTTTTT CTCAAAACCT AT AAC C C C AA AAGTAAACAT TGTGATATTG 7 9740 

ATCACAAAAT CAAAAATATT TGAGAAAAGC TCTCAACAAC TTAATTTGAT ATTTTTCTAA 7 9800 

TAAAATCAAT CTTTAACAAA ATTATCTTAC GACAAAAAGG TCTAAAACAC TACATATTGT 7 9860 

GTGTTGATTA TCTTTTTAGA TCTATATATA GTGCAATCCG CATTTCAAAA ACACTATATA 7 9S2 0 

TTGGAGCTAT ATCATGAGTA ACTTTGGGGT CATCAAGCGC GATGGTTCTC GTGCGGAGTT 7 9 980 

TGAGATTCAA CGTATCATTA ATGCCATTAA AAAAGCCGCA AGTnCGGTCA ATATTTCCGA 80040 
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TGAATTTTAT TGCCACCAAA TTGGGCAAGA AGTAGGGAAT GAAATTTTCA CCCGTCATCA 8 0100 

AGGCGAGATT GATATTAATC AGATTCAAAA AATCGTTGAA GATAAATTGA TGGCAAGTCG 8 016 0 

TTATCCTGAG GTTGCTCGTG CGTATATCGA ATATCGTCAT GATCGTGATT TAGCACGAGA 8 022 0 

AAAACGTAGC CAGCTAACAA AAGAAATTGA AGGTTTGATT GAACAAAGCA ACGTAGAATT 8 0280 

ACTTAATGAA AACGCCAATA AAGACGCGAA AGTGATTCCA ACGCAACGAG ATTTGCTTGC 80340 

TGGAATTGTG GCAAAACATT ATGCCAAGCA CAATATTCTT CCACGCGATG TGGTAGAGGC 8 0400 

TCACGAAAAA GGTGAAATTC ATTATCACGA TTTAGATTAC GCGCCGTTCT TTCCAATGTT 8 0460 

TAACTGTATG TTAGTTGATT TAGAAGGGAT GCTTTCTCGT GGTTTCAAAA TGGGCAATGC 8 0 520 

GGAAATCGAA CCCCCTAAAT CTATCGGCAC CGCAACGGCA GTGACAGCAC AAATTATCGC 8 0 580 

CCAAGTTGCC AGCCATATTT ACGGTGGCAC AACCATTAAT CGTATTGATG AAGTGCTTTC 80640 

CCCTTATGTG CAAATCAGTT ACGAAAAACA CCTCAAACAT GCCCAAGAAT GGAACGTGCC 8 0700 

TGATGTTGAA GGTTATGCCA AAGCATTGAT TGAAAAAGAA TGTTTCGATG CCTTCCAATC 8 0760 

CTTAGAATAT GAAGTTAATA CACTACATAC TTCAAATGGG CAAACACCAT TTGTGACCTT 8 0820 

TGGTTTTGGT TTAGGTACAA GCTGGCAATC CCGCTTAATT CAGCAAGCTA TTTTAAAAAA 8 0880 

TCGGATTCGT GGACTTGGAA AAAATCATAA AACACCAGTA TTTCCAAAAC TTGTTTTCAC 80 940 

CATCAAAAAA GGTTTGAACC AAAATAAAGG CGATCCGAAT TACGATATCA AACAACTCGC 81000 

ATTAGAATGC GCATCAAAAC GTATGTATCC AGATATTCTC AATTACGATC AGGTTGTCAA B106 0 

AGTCACGGGC TCTTTTAAAG CCCCAATGGG ATGCCGTAGC TTTTTAGGTG CATACGAAGA 8112 0 

AAAAGGC C AC GAAATTCACG ATGGGCGTAA TAATCTTGGC GTGGTGAGTT TGAATTTACC 8118 0 

GCGTATTGCG TTGGAGTCCA AAAACGAAGA AGATTTCTAC CGCACTTTAG ATGAACGTTT 812 4 0 

GGCTATTGCG AAAAAAGCAT TAATGACGCG CATTGCCCGC CTTGAGAATA CCAAAGCTCG 81300 

CGTCGCTCCT ATTCTTTATA TGGAAGGTG C TTGCGGAGTG CGGTTAAAAG CGGATGAAAA 81360 

TGTGGCACAA ATTTTCAAAA ATGGACGTGC CTCTATTTCA CTTGGCTATA TCGGTATTCA 814 2 0 

CGAAACCATT AATG C ACT AT ATAACAAAGG ACATATTTTT GATGATGAAC AATTACGCGA 814 8 0 

AAAAGGCATT GCGATTGTTC GCCACCTTAG CGAGGCAGTG AAACGCTGGC AAAAAGAAAC 81540 

AGGTTATGC C TTCAGCCTTT ACTCCACGCC AAGCGAAAAC TTATGCGACC GCTTCTGCCG 816 0 0 

TCTAG ATAC C AAAAAATTTG GCGTAATTGA AGGCGTCACG GATAAAGGCT ACTACACGAA 81660 

CAGTTACCAC TTAGATGTAG AAAAGAAAGT GAATCCTTAC GATAAACTAG ATTTCGAAAT 8172 0 
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GACATATCCC CCACTCGCCA GCGGCGGTTT TATTTGCTAC GGCGAATATC CCAATATTCA 
ACACAACTTA AAAGCATTGG AAGATGTGTG GGATTATAGC TACGATCGCG TACCTTACTA 
CGG CACAAAC ACACCGATTG ACGAATGTTA TGAATGTGGT TTCACCGGCG AATTTGAATG 
TACCAGCAAA GGTTTTGTCT GCCCGAAATG TGGCAATCAC GACAGCACAA AAGTGTCTGT 
CACTCGCCGC GTCTGTGGCT ATCTAGGCAG TCCTGATGCC CGCCCATTTA ATGCGGGAAA 
ACAGGAAGAA GTGAAGAGAA GAGTTAAACA CCTTTGATTA ACACGATAGC AAAAGCACGC 
ATTTTTTAAT GCGTGCTTTT TGTTTCCAAC TTTGGCATTC CTAAAAATCT CCGCTAAAAT 
AACCGCACTT TTTCTTATCT TCTAGGAAAG CAATATGTCA GATATTCTCA ATAACCTTAT 
TCATCTTCTT AAACTTGAAA AAATTGATGA TTTAATCTTT CGTGGAGAAA GCCAAGATTT 
AGGTTTTCGT CAAGTATTTG GTGGGCAAGT TGTGGCTCAA G C ATT ATC AG CGGCAATGCA 
GGTGGCGCCA GAAGATCGTA TTTTACATTC TTGTCATGCC TATTTTCTTG CTCCGGGAGA 
TAGCCAATAT CCGATTATTT ACGATGTAGA AACCTTGCGT GAAGGGCGTA ACTTTTCTGC 
ATTATGCGTA AAAGCCATTC AACATAAAAA TACCATTTGT CATGTAACCG CTTCTTTCCA 
AGTGCCTGAA AAAGGTTTTG AACATCAAAA TACAATGCCA AATGTTGGTG CGCCTGAAGA 
CTTTACGGAT GAAAATGTGA TGCTACAAAA AGTAGCTCAA ACCTTGCCAG AACCACTCAA 
TGAAAAATTT G CAGC AGAGC GTCCATTTGA GGTGAGAACA AAATATTTGA ATAACCCATT 
TAACGGGACA AAATTGCCAG CCGAGCAATA TTCTTGGTTT AAAACCAATG GTGAAACGCC 
GCTTGATATA AAAATTCAGC AATGTTTATT AG CTT ATTTT TCCGATTTTC ACTGTATTTT 
GACCGCACTT CATCCGCACG AAAAAGG CTT TTTACAAAAA GGAATGAAAG TGGCAACCAT 
TGATCACAGT ATTTGGTTCC ATCGTCCGTT TGATTTAAAT CACTGGCATT TGCACGCAAT 
TGAAAGCAAT AATGCCTTTG GAGGTCGTGG TTTGGCCCAA GGGCAGATTT TCTCGCAAGA 
TGGTCAATTA ATCGCAACAA CACAACAAGA AGGCTTGATC CGTTTTAGCG AATAAGAGAG 
AAAAAATGAC CGCACTTATT ACACAATTTT GGCAAC AAG T TGAAACTGCG CTGAAAACAG 
CAAATCCACA AGAAAAATG T CGTTTAGTCA ATGATCTCTA CGATAATCTT CTGCCACAAA 
TTCAGTTGAT TAAATTAGAA GATTTTCCCG AAATCGTGCC GCAAGATAAT ATTGCCGCTT 
TTCCTGAAAA ACCGCTCTTA GTTGCACCGA AAGATGTGCC AAAACGTTCT TTTGCCACAG 
AAGAAGGCTA TGCTGCCACA TTACACGCTA TCGCACATAT TGAGTTTAAT GCCATTAATT 
TAGGCTTAGA TGCGGCTTGG CGATTCGGGC GTAATGCACA AGAAGAACTT GGCGAAGGTT 
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81780 

81840 

81900 

81960 

82020 

82080 

82140 

82200 

82260 

82320 

82380 

82440 

82500 

82560 
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TAGCTTTTGT GAAAGATTGG CTGCGTGTCG CTCGTGAAGA AAG CACG CAT TTTTCCTTAG 8 3460 

TCAATGAACA TTTGAAAACA TTGGGTTATC AATATGGTGA TTTTGAAGCT CATGCTGGAT 8 3 520 

TATGGGAAAT GGCGCAAGCC ACTGCTCACG ATATTTGGGA ACGAATGGCG TTAGTTCCTC 8 3 580 

GTGTATTAGA AGCAAGAGGC CTCGATGCAA CACCTGTTCT GCAAGAGAAA ATCG C ACAAC 83640 

GTAAAGATTT TGCCGCAGTG AACATTTTGG ATATTATTTT GCGCGATGAA ATTGGCCATG 83 700 

TGTATATCGG CAATCATTGG TATCACGCCC TTTCTAAAAA ACGGGGTTTA GATGCGATGA 83 760 

AATGTTTTAC AG AG CTGTTA CACAAATATC GTATTGTAAT TTTTAAAGGT GTCATTAATA 83 820 

CCGATGCTCG CATTCAAGCA GGTTTCACTC AGCATGAGTT GGATTGGATT TATGAAGTGG 83880 

AACAAACATT AAAATCCTAT ATTAAGAAAT GATGAAAAAA TAAAGAGACT AAGTTCGATT 83 940 

GAATACCGAA CTTAGTCCTT TTAATGAAAG TGATTTAACG TTTGAATTTA TTG CTTACGC 84 000 

CAAGTTGTCC CATTCGGCCC ATCTTCTAAT ACGATGCCCA TAGCTGTCAA TTCATTACGT 84 06 0 

GCTGAATCTG CGGCAGACCA ATCTTTAGCA GTGCGGGCTT CGTTGCGTTG TTTGATGAGT 8412 0 

GCCTCAATTT TTGC C ACTTC ATCATCGTTA GAACCTGCTT GTAAGAATTT CTCTGGTTCT 8418 0 

TGTTGAAGTA ATCCTAAAAT CGCCCCCAAT TCACG CAAAC GTGCAGCAAG TCCATTAGCT 8 4240 

TTCTCAACAT CTTCTGTTTT TAACTTATTA ATTTCACGTG CCATTTCGAA TAAAACAGAA 84 3 00 

AGTGCATTTG GCGTATTAAA ATCATCATCC ATTGCTTCAC GGAAAGTTGC AACAAAATTT 8436 0 

TCTCCGCCAA AAGCAACCGC ACTTTGATCT GTTCCGCGTA AAGCCGTGTA TAAACGTTCC 8442 0 

AACGCACCTT GTgCTAAATT CAAGTTTTCT TCGCTATAAT TGAGCTGGCT ACGATAATGT 8 4480 

GCCGTTAATA AGAAATAACG CACCGCTTCT G CATTGTAG T GGTTTAATAC ATCACGAATT 8 4 540 

GTGAAGAAAT TGCCGAGGGA TTTCGACATT TTTTCTTTAT CCACCATAAT CATGCCAGAA 846 00 

TGGATCCAAT AATTCACATA TTGGCCGCCA TGCGCACAGC AAGATTGTGC AATTTCATTT 8 4660 

TCGTGGTGAG GGAACATTAA ATCAGAGCCA CCACCGTGAA TATCAAAATA TTCGCCAAGC 8 4 720 

TGTTTGCAGT TCATTGCAGA ACATTCAATA TGCCAACCCG GACGACCTGC CCCCCAAGGG 84 78 0 

GAAGCCCAAC TTGGTTCGTT TTCTTTCGAC ATTTTCCAAA GCACGAAATC CATTGGATTT 8 484 0 

TTCTTGATTT CATTGATTTC AATACGTGCG CCAGCTTGTA ATTGCTCAAG ATCTTGACGG 84 9 00 

GATAATTTGC CGTATTCTTT GAAGCTTTCT ACATCAAACA TCACGTCCCC ATTATCCGCA 84 960 

ACATAAGCAT GTCCGCGTTT AATTAATTTT TCCACAATTT CAATAATTTC TGGAATATGA 8 5 020 

TGCGTCGCAC GAGGTTCAAA ATCAGGGCGT AAGACATTAA GTG CATC AAA ATCTTTATAC 8 5080 

-77.52- 

SUBSTITUTE SHEET (RULE 26) 



BNSDOCIO. <WO 9633276A1 1 > 



WO 96/33276 PCT/US96/05320 

ATTTCCTGCA CCATGCGATC CACTAACTGA TCGCAGGTTT CTTTGTTTTC TAACGCACGT 8 514 0 

TTAATAATTT TGTCATCTAC GTCCGTTATA TTACGCACAT AAGTTAAATC GTAACCCAAA 8 5200 

GAACGTAAAT AACGGGCAAT CACATCAAAA CACACAAAAG TGCGTCCATG GCCGATATGG 8 52 60 

CACAAATCAT AAACAGTGAC GCCGCAAACA TACATTCCCA CTTTATTTTC GTGGATAGGT 8 5320 

TTAAAGATTT CTTTTTCTCT CGTTAAGGTA TTGAAAATTT TTAGCATTTT TAAGCTCGTA 8 5380 

AAATGAAATA AAAACAAACC GCACTTTTAT AGCACTTTTA CCTCGATTGT GGAATAATAC 85440 

AACCCTTAAT TTTTTAGAGG GAAAACAAAA TGGTTACATT ACACACAAAC TTTGG cGACA 8 5500 

TTAAAATTAA ATTGGATTTT GATAAAGCTC CAGTTACAGC AGAAAACTTC TTAAATTACT 85560 

GCAAAGACGG TTTTTATAAT AACACAATTT TTCATCGTGT TATTGATGGC TTTATGATTC 8 5620 

AAGGTGGCGG TATGGAAAGC GGTATGCGCG AAAAAGCAAC TAAAGCCCCA ATCCAAAATG 8 5680 

AAGCGAACAA TCGTTTAAGT AACAAACGTG GAACCATTGC AATGGCTCGC ACTTCAGATC 8 5740 

CACATTCAGC GACAGCACAA TTTTTCATTA ACGTGGCAGA CAATGATTTC TTAAATTATC 8 5800 

GTTCAAAAGA AATGTTTGGT CGCGAAGTCG TTCAAGAATG GGGCTATGCA GTATTTGGCG 8 5860 

AAGTGGTTGA AGGTATGGAT GTTGTGGATA AAATCAAAAA AGTGAAAACA GGCAACAAAG 8 592 0 

GCTTTCATCA AGATGTTCCA ACTGAAGATG TAGTGATTAC ATCGGTTTCT ATTGAGTAAC 8 5980 

AAATG AG CAT AAAACAACAT CGCCCTAAAG TTGGAGAAAT TCTAGAATGT GATTATGGTC 86 040 

AATTTTCGCA TACCTGTCAC GTTGATGGAC ATATTCCACC TGAAATGGTT AAAAAACGTT 86100 

TGGTTGTTGT TTTGAATGCT AAACTGAATG GGCTTATTTT AGTTGCACCA ATTTCATCTA 8616 0 

AAATTAATTT AGATGGTATA AAAAATGGCT ATCATATTGA AAT AG ATT C A GAATTAATAA 8 6220 

AAGCAACAGG CTTTTATGAT AAAAGGACTC GCTGGATTAA ATCTGAATTA ATTCAATCAG 8628 0 

TAAGTCGTCT CCGTCTGTAT CATATTTATG ATAAAGGAAC TAAAATTACT CAATATTTGC 8 6340 

CAAGGGATGT TGTAACAAAA GTTCAAAGAG CTATCATAAA AGCAATCAAT GCAAGCTCAT 86400 

TACTTGACAA GTAGATAGAA TACAGTATAG ACTTCCGAGC GTAACGCCTT AGAG CGATCA 864 60 

TCATCGAAAG ATGAGTTTTT AAACG CC ATT GAGCGATGAA TCCCCTGATT AG CG AT AATT 86 520 

AGGGGATTTT TTATAGAATT TAGATAAAGA TATTTATGCA CTTCTTCGAC ACTCACACGC 86 58 0 

ATCTTAATTA TCTCCAACAA TTTACTGGGG AACCGTTGTC ACAGCTTATT GATAATG CTA 86640 

AGCAAGCCGA TGTGCAAAAA ATACTGGTTG TGGCGGTGAA AGAAGCTGAT TTTAAAACAA 86 700 

TCCAAAATAT GACCGCACTT TTTCCTGATA ATTTGTGCTA TGGGCTTGGG CTCCATCCTC 8676 0 
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TTTATATTCA AGAACACGCC GAAAATGATT TGATTCTGTT AG AACAAG C C TTAAAAAATC 868 20 

GTGATACAAA TTGCACGGCA GTGGCAGAAA TTGGTTTGGA ACGCGCGATT CCCGATTTGC 8688 0 

TCACCGATGA ACTTTGGGCA AAACAATGTC ATTTTTTTGA AAGCCAACTT TATTTGGCAA 86 940 

AACAATTTAA TTTACCCGTC AATATTCACA GTCGAAAAAC CCATGATCAA ATTTTTACTT 87 000 

TTTTAAAACG TATTCCATTA TCTAAACTTG GCGTGGTACA TGGTTTTTCA GGAAGTTACG 87 060 

ATCAAGCAAA ACGCTTTGTT GATTTAGGCT ATAAAATCGG AGTTGGCGGC ACTATCACTT 8712 0 

ATGAACGAGC CAATAAAACT CGTCAAGCAA TCGCGAAGTT GCCGCTTGAT GCCTTAGTGT 8718 0 

TGGAAACAGA CAGCCCAGAT ATGCCTGTAT TTGGTTTTCA AGGCCAACCT AATCG AC CAG 8724 0 

AACGTATTGT GGAAAGTTTT AAAGCTCTTT GCACCTTAAG AAATGAACCT GCTGAATTGA 873 00 

TTAAAAAACT TACATGGGAA AATGCTTGCC AGATTTTTTC TTGATCCACG T C AC ATT ACT 8 7360 

GAAATTATTT TGAATTAAAT ACTAGAAAAA AGAAACTGGC ATAATAATAT CCGCCTCGTA 87420 

GAAAGTTTTT CTCAACACAC AATAAGGATA TATTATGTCG CAAATCTATA CTCAGGTGAG 87480 

AAAGTTAGCG GAACAGTTTA TTGAGCCTTA CGCCGACGAA CTCGATCAAA AAAAGAGGCA 87 540 

TTTAAGGAAT TAGGTAAAGC TGGTTGGTTT TCACTTTTAA TTCCAAAAGA ATACGGCAGA 8 76 00 

ATGGGATTAA CCCTAAAAGA ACACGCTGAA GTCTGTATGG C ATT AG CTG A AAGTTCAGCC 87660 

TCTGCTGGTC TCTGTTATAT GATGAGTAAC GTTG CAGTG A ATTGTTTAAA TCTATTCGGC 8 7 720 

AGCTATCAAC TCAAACAAAA AATCTTTTCT GACATTGTTC AAAATAAAAC CTTTGCCGCA 87780 

CTCGCCTATA GTGAATTAGG CACGGGTACT CACTTCTATT CGAGTTTTTA CATCAACATG 8 7840 

GCTTGGAATT TGGTCAAGAT TATGTAGAAA AACGCTTTGA TGTGGGTGTT GGTTTGCACG 8 7 900 

GTGATCATGT AGGCGGTGAA TTAAATTCAG CCAATGCATT CACCGAAACC TTGTTCAAAA 8 7 960 

TGGATTACAA CAATCCAGAA CATAAAGAAA TGATGGATTT GGAAGGATTA AAACGTTGGA 8 802 0 

TTGCGCGCAG AAAAAGCCTT AAATTACCTT CCACCAGAGC AAATATCAAA ATTTCAGACA 8 8 080 

AAAAATTGCC CCATTAAGGG G C AATCTTAT CAATGTTCTA AAGTTACTTA TAGCTTTAAA 8 814 0 

CAACGATTAA ATATGTTGGT TAATGAAGTT AG C C AATTG A GTTTTTGGTA ATGCACCAAC 8 8 200 

TTGAGTTGCA ACAACTTGAC CATTTTTAAT CAATAATAAT GTTGGGATAC TACGAACACC 8 8 260 

AAATTGTGCT GGCGTTGCTT GGTTATCATC CACATTCATT TTCACGATTT TAACTTTACC 8 8320 

TGCAAATTCA GGTG CAAGTT CATCTAACAC TGGTGCAATC ATTTTGCAAG GACCACACCA 88 380 

AGGTGCCCAA AAGTCTAATA AAATGGGGAT ATCTGAATTA ACTACAACAC TTTCAAAATC 8 844 0 
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TGCGTCATTA ATGTGTAATA CTTCGCTCAT TTTGTTTTCC TTATTTTGTG AATGGAATCC 8 8 500 

CTATATAAGG GCTTTTTTTA AACTTACAAG TGCGGTGCAG AATAACACCG TTTTAAAAAA 8 8560 

TGTAAACTAA TCG TTTTAAT TGCCGAAAAC T AAGACGG C A AAACAATATT TTCACACACT 8 8620 

TTAC CCGATT TCAATTTATA AATGTTTGAT AAAGTCACGT CTGCAATATT GGTTAGTGCT 8 8680 

TCTTCGGTTA AAAACG CTTG GTGACCTGTT AGCAATACAT TATGGCAAGA AGATAAACGG 8 8 740 

CGGAAAATAT CATCTTGAAT CACTTCGTTA GACTTATCTT CAAAGAATAA ATCACGCTCA 88 800 

TTCTCGTAAA CATCCATACC TAACGCACCG ATTTTACGCT GTTTTAACGC ATCAATCGCA 88860 

GCTTGAGTAT CAATCAAAGA ACCGCGACTC GTATTAACGA TCATCACGCC ATCTTTCATT 88 920 

TTTGCAAAAG CCTCGCAATT TAATAAATGA TAGTTTTCTG GCGTTGCTGG GCAATGAAGC 8 8980 

GTAATCACAT GAGATTTAGC ATAAAGTTCA TCTAATTCAA CATATTGTCC CCCCAGTTCT 8 904 0 

TCTACTACTG GATTTTTGAA AGGATCGTAC GCCAAGATAT TCATGCCAAA ACCTTTCAAA 8 9100 

ATTCGCATCA CTGCGATCCC GATTTTTCCT GTTCCGATTA CGCCGACTGT GCGTCCGTAC 8 9160 

ATATTGAAAC CAATTAACCC CTCAAGAGAG AAATTCG CTT CTCGGGTACG CTGATAAGCA 8 9220 

CGATGAATTC GACGGTTCAG AGTCATCATC AAACCGATGG TATGTTCCGC CACCGCTTCA 8 9280 

GGCGAATAAG CTGGAACGCG GACAACTTGG ATGCCTAATT CTTGAGCTGC TTTTAAATCA 8 9340 

ACATTATTGA AACCGGCACA GCGCAAAGCC ACAATTTTCA CACCAAGTGC GGCTAATTTT 8 94 00 

TCTAATACTT TTCGGCTACC ATTATCATTC ACGAAAATGC ATACTACTTC GCAATGTTCT 8 9460 

GCCAAACGCA CAGTGCTCTC GTTCAACATA AAATCAAAAA ATTCTAAATC GAAATTATAT 8 952 0 

TTCGCGTTAA TGAGCTCAAT ATATTTTCTG TCATAACTTT TAGTGCTATA GATTGCAATT 8 9580 

TTCATGGTTT GCTCCAATTA TTTTAAGTTT GCAAAAGCTT GTAATAAGTC CGCTTTAATA 8 3640 

TCTTCTACAT CTTCAATACC GACTGAAAAA CGCAACAAGG TATTAGTAAT CCCACGAGCT 8 9700 

ACACGCTCGG ATTCTGGAAT ATCCATATGA GTTTGAGTGG CAGGATAAGT AATAAAACTT 8 976 0 

TCTGTGCCGC CTAGACTTTC TGCAAAAGTA ATCAATTTGA TGGATTTTAA AAATGTATTA 8 9820 

ACCCAAGCTT CATCTTGCAA ACGGAAAGAC AACATCCCAC CTTTATTCGG ATAAAG CACC 8 9880 

GATTCCACTT GTGGTTGAGC TTTTAAAAAT TCCGCAATAG CTTGTGCATT TTCTTGATGG 8 9940 

CGTTTCATAC GCAAAGAAAG GGTCTTCATA C C ACGAAT AG TCAGC CAAGA ATCAAAAGGA 9 0000 

GAAAGCACCG CCCCTGCACC ATTTTGAATA TAAGCAATGC GATCACAAAG TTCTTGCCCT 90060 

TTCGCAACAA TTAAACCAAC AAGTGCATCA TTATGCCCAG CAATATATTT AGTTCCGCTG 9012 0 
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TGAATTACCA CATCCGCACC TAAATCTAAT GGACGAGAAA GCACAGGCGT TAAAAACGTA 90180 

TTATCCACGA TCAACATTAA GTTATGTTTT TTCGCAAGTT TTGCAATTTC AACCACATCA 9024 0 

CATTCTTCCA TTAAAGGATT CGATGGGGTT TCAATAAAAA TCGCTTTCGT ATTTGGGTTA 903 00 

ATGGCAGCCT CAATTGCTGA AGCAGATGCT GTATTAACAT AAACAGGTTT TACACTATTG 90360 

TTATTTTTAT AAGAAAAATC TAATAATCGA TAAGTACCAC CATACACATC GCTAGAAACA 90420 

ATCCATTCAT CTGGGGCGGT AAAAAGCGTC ATCAAAACTT GAATTGCAGC CAT AC C AGAA 904 8 0 

GAAAACGCAA AACCACGATC TCCATTCTCT AATTTAG CT A TCGTTTCTTC TAATACTGTG 90540 

CGAGTTGGAT TTTTGGTACG TGTGTAATCA AATCCCGTAC TTTCACCGAT ACCACAATGA 90600 

CCATAAGCCG TAGAAAGAAA AATTGGTGCA GAAACCGCGC CCGTACGCTC ATCACTGCGA 9 0660 

TTGCCTGCTT GAGCTAATAA AGTATCGATA GCATATTGTT GTGTCATAAC ATCCTCTAAA 90720 

AATAATGAAA AAATTGACCG CACTTTACGT GTTAAAAAAC GTTTAGCATT TTGGCACAAA 9 0780 

GGAAAAAGTG ATAAAAGTAG AAATTAGAAA ATTACATAAA ATTTCATTTT TCCATCATTT 90840 

TCTGTGATTT ATTTGAACAA AATGAACTTT TTTTAAACGG TTAGACAATT TTTTTAAATC 90900 

TGTAATTGAC AGTTATCTGA ATTTCCGTAA AATGCGCACC AATAACAGCG AATGCGGGAA 90960 

TAGCTCAGTT GGTAGAGCAC GACCTTGCCA AGGTCGGGGT CGCGAGTTCG AGCCTCGTTT 91020 

CCCGCTCCAA TTCGTGGCGT G TT AG CAAAG CGGTTATGCA CTGGATTGCA AATCCATGTA 91080 

GCTCGGTTCG ACTCCGGGAC ACGCCTCCAT AATACGCAAA TGTTAGTTGT CTATATGCCC 9114 0 

GAGTGGTGGA ATCGGTAGAC ACAAGGGATT TAAAATCCCT CGCCTTTCGA GGCGTGCCAG 912 00 

TTCAAGTCTG GCTTCGGGCA CCATTTAAAC ACCAATCCAA AGATTTTAAA ATTTACTTTA 9126 0 

AAAT CATCTA ATTGGGTCGT TAGCTCAGTC GGTAGAGCAG CGGACTTTTA ATCCGTTGGT 913 2 0 

CGAAGGTTCG AATCCTTCAC GACCCACCAC TTAAAAATAA AGTAAGCACC GAAAGGTGTT 913 8 0 

TTTTTATTGC CTAAATCTGA TACTACTTTT TAAATTACTT GCTTATTGAA AAAT AG CTAT 9144 0 

TCTCTCCATC CAATCTAAAA TAATTAATAT ATTTTATCGC TACGTTATAC TTTCTAAATA 9150 0 

AAATAGCACC TGAAAATCAG ATGCTATCCA AAACACAATA AAAATTTACC GCACTTTATG 91560 

ACTTCAATAA GTAAGCCCGT AGTTGTGCAA AATCATTATC CATTTCATCT GAAAGCAATG 9162 0 

GTAGCTGATT ATGTTTATCC AGTGTTTCAG GTAATGGAAG CTGAATACCA AGAATACGTT 9168 0 

CTACGGATTC TTTAAATTTC GCAGGATGCG CCGTACACAG GAAGATTCCT GTTTCACTCG 9174 0 

CTTTAAGTTG ATCTTTCAAC ACCTGATAAG CTATCGCACC GTGAGGTTCA CATAAATAAC 91800 
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AAGGTATCTT CTGTTTCGCT ATCACTTAGC ATCCCCGAAC 9186 0 

CCATTTCGTT TAAACAATTC CTCCACTCTT GGCCAATTAT 91920 

G C ATT AG AAA GGGTTGCAAC TGTCGTTTTA GGATCCCAAT 9198 0 

GGCACAGTAT CATTTGCGTT TGTTGAGGCG ACAAAGCGTT 9204 0 

TTTG CAATTA ATCCCGCAGT T AAGTT AC C A AAATTACCAC 92100 

TTATCACGTT TTTCTTTTGG CAATTGTGCG ACAGCTTCAA 92160 

AATAAACGGC TGATATTAAT AGAATTTG CA GAGTTAAGCC 9222 0 

ACATCATCAA AGGCTTGCTT AACCAAAGCC TGACACGCAT 922 8 0 

ACAGTACGAA TATTCCCACC TAAAGTACAG AACAGCTTTT 92 34 0 

CCTTTTGGAT AAAGGATAAC TACATTAATA TTTTCTAATC 92400 

GCGGCTCCAG TATCGCCAGA CGTAGCCGTT AAAATAGTAA 9246 0 

GCAGCAAGTG CCTGTGCCAT AAAACGCCCA CCAAAATCTT 92 52 0 

TGAAATAATT CAAGTGCATA AATATTTTCT TCCACTTTTT 92 58 0 

AATGCATTTT TCACCATCGC ATCTAACGTT GCTTGTGGTA 9264 0 

CCTAGAATTT TTTGACTACG CTCAACAAGA GGCAATTCAA 92 700 

AGTTGAGGAA TAACTTCTGG AAAGAATAGA CCTTGATCCC 92760 

GCCTGAGAAA AAGTGACTTG TTCTTCTGGG TGTTTGATAT 9282 0 

CCGTTTTTAA TCTGATTTCA TCTATTTAAA TTAACTTATT 92 88 0 

TTATCAACTT TACATACATG CACAAAACCT TCATTATTTT 92 940 

TAACTTGAAA GTTTAATTGC AGTTTGTAAA TCAGGGGCAA 93 00 0 

GAGCCTGAAA TACCTGTGGC AAGTGCGCCT AAGTCTCGAG 93060 

AAATTAGGTA ACAGAGATTC ACGATAAGGC T C GG C AATG A 9312 0 

GCAAGATTTT CCTGATGGGT ATGGCAAGCA TGCACAAAGC 93180 

GCAATCACGT TTTGTCGAGT ATAGCTTTTC GGTAAAATTG 9324 0 

ACTTC AATTC CCGGATAGGC TAACACCCAA TACCAATTAT 933 0 0 

CAAATATTGC CGAGCGATTG CACCATAAAT TGCACACCGC 9336 0 

TTATCATAAT GGATTGAACC AGAAATACGA CCTTCCAATT 934 2 0 

TCCATTTTCG AAAAAGGTTC ATTATGAAAC TGATTTAATG 934 8 0 
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CCACTAATGC AG CAACAAT A GAACAAGCAC TAGAGCCTAA ACCAGAGCCA ATAGGCATAT 9 3 540 

TTTTTTCGAG CGTTAAACGT AAAGGTTTAA CATTTGCTCC GCGTAATTTT AATTGCTCGC 936 00 

TAAATAGCAC ATAAGCTTGG TAAACAATAT TTTTTTGCGG TTCTTTCGGT AATTTACGAA 93660 

CAAAATAGCC CGCACTTTCT AACTCAAAAC CAGTTGAAAT AGATTCTATT TGAACCACAT 93 720 

CGCCTAACAA AGAACCATCA ATAGGAGAGA TCGCAGCACC TAAAGTATCG AAACCTACGC 9 3 780 

TAATATTCGC GCTAGATGCA GGGGCGTAAA TTCTTAACAT TGGGAATCCT TATTAATGTT 93 84 0 

GTAATGTTCT TAAAATATCT GCAAAGATAC CTGCCGCAGT AACTGCATTA CCTGCGCCAT 93 900 

AACCACGTAA TAAAAGTGGA ATCGGCTGAT AATAGCGAGT GTAAAATGCC AAGGCATTTT 93 960 

CACCATCTTT CACTTTATAA AGTGGATTAT TCAGATCTAC CGCAACGATG GATACTTTGC 9402 0 

ATTTTCCTTC ACTAATTTTT CCAACATAAC GCAATACTTT GCCTTCTGCT TTAGCTGTGG 94 08 0 

CAACGCGAGT TTTAAATTCT TCATCAAGCT GAGGCAACAT TGCCATAAAC TCATCTGCAG 9414 0 

ATTTTCCATC GGAGAAGCCT TTTGGCAATA CACCTTCAAC TTCAACATCT GAAAGTTCAA 942 00 

GTTCAATTCC CGCTTCACGA GCAAGAATTA ATAATTTGCG TGCTACATCT TGTCCAGAAA 9426 0 

GATCATCTCG AGGATCTGGC TCAGT AAAG C CTTTTTCACG CGCAAGTGCG GTTACTTCTG 94 320 

AAAGAGAAAG TCCTTCTTCC AACTTACCAA AAATAAAGGA AAGCGAGCCA G AT AAAATC C 9438 0 

CTTCAAAATA TTCAAGCTCA TCACCTGCTG CAAGTAAGTT TTGGAGATTT TCAATAACTG 9444 0 

GCAAGCCCGC ACCTACATTG GTTTCATATA AAAATTTATG CTGGCTTGCT TGGGCATTTT 94 500 

GGCGTAACTC ATTGTAATAA ACTAATTCTC GTGTATTCGC TTTTTTGTTT GGCGTAACCA 94 560 

CATGGAAACC TTCTTTTAAC GCACGAGCAT AAAGCCCTGC TACAGATTCG GCTGAAGTAC 94 62 0 

AATCTACAAA TACTGGATTC ACCACATGAT GTAATTTAAT GAAAGAAAGC AATACATCAA 94680 

AATCTGATGG CTGAGTCGCA TTTTCTAAAT CGTTTTTCCA ATCCTCTAAA TTT AATC CAT 94 74 0 

TTTCATCAAG CAACATACGG TTGGAATTCG CAATAGCACA AACTCGAATT TCAACATTCT 94 8 00 

TCTTTGCTAG ATACTCCTTT TGACGCTTTA CCTGCTCAAT TAATTCTCCG CCTACTCCAC 94 860 

CAACGCCCAC TAAAAACATA TCCACGACTT TTTTATTATT GAAAAGCGCC TGATGGGTAG 94 92 0 

CCTTCACCGC TTCAATGGCT TTATTTTGCG GCACAACCGC AGAAATTGAA CGTTCGCTAG 94 980 

ATCCTTGTGC AATCGCCACA ATGCTAATGT TTGCCTGCGC TAATGCCGAG AAGAAACGAG 9 504 0 

CAGCAATGCC TTTCGCTTGC TTCATTCCAT CGCCCACAAC AGAAATAATG GATAAGTCTT 95100 

TAATCACTTC AATTGGTTCT AATTGATGTT CATTTAATTC ATTAGCAAAC TCCGTTTCTA 9516 0 
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ATCGGCCCAT TACAATCGAA TAAAACCCGT CTCGTTGCAG AACATTTTGA TTGGATGCAA 96900 
ACGCTTGATC GAGCAAAAAT TGCAGATCGT TTAAATGAAC AACGACCGAC CAACAAGGCT 96960 
C CATTG AATG TATTGATTCA GATTAATATC AGCGATGAAG AAAGTAAATC AGGTATTCAA 
CCTGAGGAAA TGCTAACACT AG CAAAACAC ATCGAAAATT TACCGCACTT ATGCCTACGT 
GGCTTAATGG CAATACCCGC ACCAACCGAC AACATCGCAG AACAAGAAAA TGCATTTAGA 
AAGATGTTGG AGCTGTTTGA ACAACTTAAA CAAGTCTTAC CCAATCAACA AATTGATACA 
CTTTCTATGG GAATGACGGA TGATATGCCG AGCGCAATAA AATGCGGTTC TACAATGGTA 
CGCATTGGCA CTGCGATTTT TGGTGCGAGA AATTATTCAA CAT C AC AAAA TAAATAATCA 
TGGAATAAGG CCCGAATTTA AAACAAGCCT TATTTCTTAA TGAAATTTAA ATGCCAGTAC 
TCTCGCCACA TTTTGAGCGG TAGAAATTAA ATTTTGTTCG CCTTGTTTGA GAATATCCGA 
TAAATCGCCA AGTTGGTGAA TAATAGGGAA CACCGCATCA ATACCGTGAT CAAACACCAC 
ATCATAATCT TCACGCAAAC ACCCTGCGAT GGCGATCACT GGCTTATTGA ACTGTTTTGC 
CGTACGTGCC ACACCAATTG GCGTTTTACC CATAATACTT TGTG CATCTA TTCGCCCTTC 
ACCCGTAATC ACTACATCAG CATCTTTTAC ATAGTCAATT AAATGTAACC GGTCTAACAC 
AATTTGTATG CCTG CTTTC A GTTGTACGCT TGGTAATAAT AGCAAGCCAC CACCCATACC 
ACCAGCAGCA CCTGCGCCAG CCTGATCACG AATTTGTTTA CCGCAATCGC GTTCGGCAAT 
TTCAGCAAAA TGCGAAAGTG CAGCATCAAG TTG TTTCACC ATTTCAGGCG TTGCGCCTTT 
TTGTGGTCCA AAAATGG CAG ATGCACCACA TTCACCACAG AGCGGATTAT TCACATCGCA 
CGCTACTTCA ATGTGAACTT GTTGCAAACG TGGATCTAAT TGTTGAACGT CAATCTTCAC 
AATATTTGCT AGCTCTGCTC CACCAAAACC GATAGGCTTA TCTTGCGAAT CTAAACATTG 
CATCCCTAGT GCTTGCAACA TTCCTACACC ACCATCGTTG GTTGCGCTAC CGCCAATGCC 
AAGAATGAAA CTCTCTACGC CTAAATCTAA TGCCAGTTTA ATCAACTCCC CTGTG CCGTA 
ACTGGTGGTA AGCAAGGGAT TGCGTTTTTC AGGCGGAACA AGATGTAAAC CAGAAGCTGC 
TGCCATTTCG ATAATCGCGG TTTTTCCATC TCCCGATAAA CCGAAGAAAC TTTTCACTTT 
GTCACCCAAA GGCGCGGTGA CTTCACATTC AATGAGCTTG CCTTGCGTGG CATCCACAAG 
GGATTGAACT GTACCTTCGC CCCCATCTGC CATTGGCAAT TTTAC AT AG T CCGCATCGGG 
GAATACACGT TTG AAG CCTG TTTCAATGGC GGTAGCGACT TCAAGTGCGG TCAAACTTTC 
TTTAAAAGAA TCAGGTGCAA TCACAAATTT CATAACAACT CCTAGAATTT AAATACG CCA 
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AAAATTAAGG TAGAAACGAT GGTCATCATC AAACCGACCG CACTTTCATA AGGAATAAGT 98 58 0 

TTTAAACGTT CTTTGATAT C CATATTCACA CTACCGCCTG TGGCATGGAA GAAAGAACCA 98640 

TGTGGCATAT GATCAAATAC GGTCGCCCCT GCATGAATCA TTGCTGCACC AGCAAGACTG 98 700 

CTAACTCCAA GTTCTAACAA GGTACTACTG AACACATTGG ATGCCACCGC AGTACCTGCC 98760 

GTGGTAGAAG CTGTCGCTAA AGACATCAAC ACCCCTGAAA TTGGTGCAAG AATGTATGAT 98820 

GGCAAACCTG AATGTTCCAA ACCTTGAATC AATACTTCTT TTAAACCAGA ATTAGCAATG 98880 

ATGCCAGCAA GCGCGCCTGT ACCAAGTAAC ATAATTG C CA CGGGTGTCAT TTTACTCAAT 9 8 940 

CCATTGATAG CATAACTATT TATATTGCGA AGTTTGCCCA TACAAAACGC ACCAATCAAA 9 9000 

CCACCTAACG GCAATGCAAT TAATGGATCA ACTTTAATAT CAAACAATGG ACGAAGTGCC 9 9060 

AATAGCAAAA TTGCCACTAA AGGTGCGACA AGTGCGGTCA GAAATGAAGG TAAATTTTGC 99120 

GTTTCAAGCA CGATTACCTC TTTGTCAGTC ACTTTAGAAC CTTTGTTGAT TAAGCGTTTT 9 9180 

GCTAAGAAGT AGGTGAGAAT CAAACCAAAT AGCGCGGGAA TTATACCAGC CATCATCACG 9 9240 

GAAGTGAGTG GCAAATGAAA AGTATCTGCT GCCGCAATGG CATTTGGGTT TGGCGACATA 99300 

AT ATT AC C AG CTTTACCACC GCCAATCATG GCGAGCAAAA TTGCAGCTTT CGATAAATCG 9 9360 

CTGCGACGCG ATAACGCTAA AG CAATTGGT GAAACAGTAA TCACTGCCAC ATCCACGAAC 9 9420 

ACACCCACCG CAGTTAAAAT CATCGTCGCT AGTGCCAAGG CGAGTAAAGC TCGAGTTTCA 9 9480 

CCCAATTTAT TAGTAATGGT TTCCGTGATG GAATTTGCTG CACCCGATTC AATCAGCACA 9 9540 

CCGGCCAACA CACCAGCAGC TAAGATACGC ATAACCGCAG TAGTAATACC CTGTGCGCCA 99600 

CCGATCATTA GGCTCACTGT TTGCGATAAA TCCGCACCAC CAATTAAGCC TCCAACCAAA 9966 0 

GCTCCCACTA ACATACCATA AGCTGGTGAG ACTTTTTTCA AAATGAGGAA GATAGCAACG 9 9720 

ATTAACGCCA CCAACGCACC TATTGCTGAG ACTGTTGTCA TTTTTTCTCT CCTATAAAAA 9978 0 

TTAAAAC CAA GGAGATTATT TCTGTTTTAT AGGGGAAATG CTTTATTAGA GCAAACAAAA 9 984 0 

AGTGCGGTTA AAAATCATCA AACTTTGTAA AGTTATACAA CCTCAATGTT CTAACAACGT 9 9900 

TCCGAGATAG AGCGTTAATT TATCATCTAT CTTATTGAAA AATAAGCCAG TTATTTGCTC 9 9 960 

TATTTTGTTC AATCGATAAC GCAAAGTATT CGGGTGAACA AATAATTTTT CCGCAGTGAG 10 0020 

ATAAAGATCA CAATTTGATA AAAAATATTG TTGAAGGGTT TTATAAAGTA TCGCATTTyC 100080 

TTCTGAGAAA nGGGGAGAAA GGGGTTTTAT CAATTCGTTC CCTTGCCAAG AATGGGATAA 10014 0 

TCCAGCAAGT AACACAGGCA ATCGATGCTC ATCAAACACA TAAATACCTT TGCGTGGATG 10 02 0 0 
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100380 


GACCACAACC 


TGATCAAGGG 


ACAAAATTGC 


CACATCCTGT 


G C AAATTC AG 


ACTGTTCCAA 


100440 


ATAATTGATC 


AGATTCTGTA 


AATTATCCAG 


GGCTGGATTA 


AGTAACTTAA 


TCAATACAAC 


100500 


CACACGAGAT 


TTATTTAAAT 


CAAAAGAAAA 


AAATTTCGCC 


TGCTGTTCCA 


TCTCTTTCCA 


100560 


ATTCAAATTG 


CAATGTAATA 


ATTGTAAAAT 


AAATTCTTCT 


TTGTAACGAC 


GATGCCAGCT 


100620 


TTCTTGTTCG 


AGTAAAGCCT 


GTTGTTCCAC 


AATAAGTTCT 


GCCGTCATTT 


TCACTAGTTC 


100680 


GGCATATTGT 


TTCACCTGAG 


TTGGCTCACC 


AGAAATCCCC 


ACCACTCCAA 


TATTTTTGCC 


100740 


TAAATAATGA 


ATCGGTAAAT 


TAATCCCTGG 


TTGTGCTTCA 


AAATTC C ATT 


TTTGTGCCAA 


100800 


CGCTTGATCA 


ATCTCTACCA 


CACGATTTTC 


CCGCAACGCC 


AACACCGCTC 


CTGTGTGCCG 


100860 


TTGATTCAAA 


CGCGTTGAAT 


TTCCCGACGC 


AATGATCACA 


CCGTCGTGAT 


CCATTACATT 


100920 


T AC AGAGT G A 


TGAATGATCT 


TCATTGCACG 


TTTAACAATT 


TTTTTGGCAG 


TGTATTTGTC 


100980 


AAGTTGCATA 


TTTGTAATAA 


AGAAGATAAA 


TGAAGATAAA 


ATATTGAGAA 


AAAGAAAGTT 


101040 


AGTCAAATAT 


TGTTTTATCT 


GCAAGAAAAG 


ACGGGAGACT 


ATTCTCCCGC 


ACTGCGTTTT 


101100 


TATCG CTATA 


ATGTTTTATT 


CTGAACTTGA 


ACATATCCCA 


TAATTAATGC 


CAAAATACCT 


101160 


AGTGGAAAAA 


TGAGATCTAT 


TACAGAACCA 


TCATTTCCCC 


AAATCAAGAA 


TAGCTTTGAT 


101220 


GAAAATAATT 


AAAATCGGTA 


AGACAATAGG 


AATACTCACG 


ACATAGCCCG 


TAATGGCTAA 


101280 


TGCCCATTCT 


TCCTTTTTCT 


TAC CTAGAAA 


TTTGATAAAG 


CTGAACGCCA 


TTTTTTCAGC 


101340 


AACTGGGCAG 


ATTGAAGTTT 


CACCATGTAA 


AGGTTCTCGA 


ATAATGCTAC 


TTACAAAGGA 


101400 


AGAAGTCGCT 


TTATTTGGTT 


TTCCTGGACG 


GATACAAATT 


GTGGGCAGGC 


GAAGACAGAG 


101460 


ACCATCAACA 


AAACCTTTTC 


TTGTATAGTC 


ATTAATAAGT 


AATTCACTCA 


TTG CTTTTTG 


101520 


TGCACCATAA 


GTCGATTGTG 


GCGTTACAGC 


AGTAGAATCC 


TGAATAATTT 


CTGGTAAATC 


101580 


ACCACCAAAT 


ACAGCAAGTG 


AACTAGAAAA 


ATAAAACGAA 


TTTTAGTATC 


AG ATT CATC A 


101640 


AAATATGGGA 


AAGTGGCGAC 


ATTCCATATT 


TGTCCACTAA 


CAAATTTTGA 


TCACATAATT 


101700 


TGCGACAAAA 


ATCTTGTTGA 


ATCTG CAC AG 


AATGG C ATT A 


GAGAATTAAA 


TATTAATTTA 


101760 


ACTATCGTAT 


AAATAAATTA 


ATATAAATGA 


TCTGCACCCC 


AAAACTAAAC 


AAATAAGCTT 


101820 


ACTCATTAAG 


GTGCAAATCA 


AAGTGCGGTT 


CATTTTTATA 


TTATTTTTGG 


CTACATACAG 


101880 
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CAATAAAATG 


TA ATTTATr^T 
* x x x l \j X 


1 LAGbA 1 LA 1 


r AAAGG TTT T 


GAACATTCTT 


TTAAATTGCT 


101940 


CACGATTTTC 


^»vjv— x x icniL 


uUAl l-fU_.Vjtj.rt. 


*T^^^ A »W^ni 1 1< IM11 * 

1 bA r 1 TTT AA 


CGTACCGaAA 


ATTCCTTCAT 


102000 


CATAAATCAT 




/"•A A A r""T A A T'f"* 

bAAAb 1 AATG 


TCATCTCACC 


AGAGAAAGTA 


TCAACATTTC 


102060 






ATGCCTTTCC 


ATCCATCTTT 


CGTTAATGGC 


GTGACAGTCA 


102120 


v» w X X X \»v?v<_ 


111 UutaCATA 


1 |UI WHI W/ » ^ « » 

ll IT C T AG AA 


TAGTTTGATG 


ATCATTCCCC 


AC CAGCATAA 


102180 


1 vo i Vino 1 


AAta LAATAAA 


C CATT GGG TT 


TTAACACTCG 


AAAATATTCT 


GCAATGGCTT 


102240 


A X X Iv. 1 1 UuL 


X 1 UUAL-UKjij 1 


AACATTGTGA 


GCATCGCTTC 


ATTGATGACA 


ATATCAAAAC 


102300 


XXX v..#-v 1 1 L 


P* A A A /""• 1^ /T A A 


TTCATCGCAT 


TCGCACGCTG 


TACATGAATT 


TTTTCCTGCA 


102360 


nu^^Ai X X Vj l». 


1 1 LAATATTT 


GCTTGTGCTT 


TTGCTAACGC 


ATTTT CATCT 


AAATCAACAC 


102420 


\— X x X >V X VJ 


A c* a a A A A *T* 

ALAALLAAAT 


TGTTTCGCCA 


ATCCAATTGC 


AGTCGTCCCC 


ATATTACAGG 


102480 


CAAPPTC'PAA 


v-AL 1 T ITTTA 


TCTTGGCTAA 


AACCGCCATT 


AGCAATTAAC 


CAATCTGTCG 


102540 


v— X X X X X X V- 


<jv_.L IVjGACGC 


TV TV TV f*/"* /*^/ »' IX Mill 

AAACG CGTTT 


TACCCAAACG 


TGCAAGAAAA 


TTATGCCCTA 


102600 


V- A X v^nX V— X X X 


lvjt-V-Al 11X1 


ATCTCCTAGT 


AATTAACTTG 


GTTTTTATAT 


TATAGATAAA 


102660 


A A TCI A T A A 


A 1 1 M 1 ' A A A A *t+f**T> 

Al 1AAAATGT 


CAGATCAAAT 


AAATAnGGCG 


GAATAATTGT 


TCCGCCTTTT 


102720 


V-UXAl i>i 1 X 1 


u rrCAATGCT 


TTTTTCAATT 


CAAGTAATTC 


ACGACGTAAC 


TGTTTCAATT 


102780 


HjLI 1 it,v-AT 


TTCAG CTTG T 


TTATACAGTT 


TTTTGTCGGT 


TTCTGCTTGT 


TGAGAGCGCA 


102840 


x LA 1 LALTAC 


TTCTTGT C AC 


AATTGCAAAA 


CTTCCGCAGA 


ATTGACCGCA 


CTTTGTGTTG 


102900 


AAA PT'T A A «"P/"« 


TCTCAAGCGG 


TTTATTATC C 


AGATACGAAA 


TTTGTTGGTG 


TGGATTTAGC 


102960 


vjvaA X x LAA 


ATTACCCAAG 


GTAATGAAAT 


TATT CAATCA 


ATGGGCTTAA 


CAAACGTTCG 


103020 


111 UL. 1 IbAA 


TATTTTATTT 


ATCAAGTTGG 


tTCATTTACT 


ATTCAATCTT 


T AAC AC AG C A 


103080 


A TTP A /"* /~« A A 

1AI IbAGGAA 


AATAAAGAAT 


TTGCCAATAT 


TACTGAAAAT 


GAACTGTATT 


CTGCGGTGCT 


103140 


HiH x 1 A\j 1 1 


ATTTTAGGTT 


ATGTCTATGT 


TTATCTTACG 


ACCTATCCAA 


TTTATTCCTT 


103200 




AAAACTTACA 


TAC C G AAAAG 


CTTTACTCAA 


TATGTAAAAA 


CATTGGTAGA 


103260 


a nr; ccx n a a a t 


LAGTATATTG 


GTGCTGGAAA 


TATGTATAAC 


GGCGACGTAG 


AAGATTTGAA 


103320 


TAAACTGCAT 


CTTTATATAA 


TGTCTCAAAT 


GGAAAAACCG 


ACAACTAAAG 


CTGAATTGAA 


103380 


ATCAG CATT A 


CAAGGCTATT 


TAATTCAAAA 


TGAATATCAA 


GATATGAATA 


ACAATGATAA 


103440 


ATTGATTGAT 


GAAACTTATG 


ATTGCACTGA 


GTTGTTTAAT 


GCGCTTTTTG 


ATGTATTAAC 


103500 


AAGATTAGGT 


ATATCCAGCC 


TGTAAACCTA 


TAATCTGTGT 


ATGGTTCGTT 


TAATGTACCG 


103560 
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TTTTGTTTTT CATCTAAATA TCAATTTAAA TTGAGAACTA TTTGCATTTA AATAAAATTA 103620 

GGCATAGAAT AGCCACAATT CAGGCTGAAC TGAATAATTT G C ACTCAAC A TAAGGAGCTT 103680 

TTCAATGCAA TTTAAACATT TCAAACTTGC TACCCTTGCG GCaCACTTGC TTTTTCTGCT 10374 0 

AATAGTTTTG CTGATATTAC CGTTTATAAT GGTCAGCACA AAGAAGCCGC TACGGCTGTG 10 3800 

GCAAAAGCCT TTGAACAGGA AACAGGCATT AAAGTTACGC TAAATAGCGG GAAAAGTGAG 103 860 

CAACTTGCAG GTCAATTAAA AGAAGAAGGC GATAAAACAC CAGCCGATGT TTTCTACACT 103 92 0 

GAACAAACAG CGACTTTTGC CGATCTTTCT GAAGCAGGGC TTTTAGCACC AATTTCAGAA 103 98 0 

CAAACCATTC AACAAACCGC AC AAAAAGG C GTACCACTTG CACCGAAAAA AGACTGGATT 104040 

GCATTAAGTG GCCGTTCTCG AGTAGTGGTT TACGATCACA CTAAATTATC TGAAAAAGAT 104100 

ATGGAAAAAT CAGTGCTTGA TTATGCAACA C C AAAATGG A AAGGCAAAAT CGGTTATGTA 10416 0 

TCAACTTCTG GTGCGTTCTT AGAGCAAGTT GTTG CGTTAA GCAAAATGAA AGGGGACAAA 10422 0 

GTTGCGCTTA ATTGGTTAAA AGGTTTAAAA GAGAACGGAA AACTTTACGC TAAAAATAGT 104 28 0 

GTGGCATTAC AAGCGGTTGA AAATGGCGAA GTACCTgCTG CGTTAATCAA TAACTATTAT 104 34 0 

TGGTATAACC TAGCAAAAGA AAAAGGCGTG GAAAACCTAA AAAGCCGTCT TTATTTTGTT 1044 0 0 

CGCCACCAAG ATCCAGGTGC GTTAGTTAGC TATTCAGGTG CAGCAGTATT GAAAGCCTCT 104460 

AAAAATCAAG CTGAAGCACA AAAATTCGTT GATTTCTTAG CGAGTAAAAA AGGTCAAGAA 104 52 0 

GCATTAGTGG CAGCGCGTGC AGAATATCCG TTACGTGCTG ATGTGGTTTC GCCATTTAAT 104 580 

CTTGAACCTT ATGAAAAATT AGAAGCACCA GTGGTGTCCG CAACAACAGC TCAAGATAAA 104640 

GAACATGCGA TCAAATTAAT TGAAGAAGCT GGATTGAAAT AATCCAAATA TTGCGTGCGT 104 7 00 

GTATTTGAAA AAATTCACGC ACGCGTTTAC AATTCTCTAA CAAAATTAAC CGCTTGCGTG 10476 0 

CGTGAACTGA AAAGTTCACA TTCCTAAAAG GATTTAACTT TGCCTCGCAG ACCGCCATTC 104820 

TGGCTCACTT TACTTATCAT CTTAATCGGG CTTC CGTTAT GTTTGCCGTT TCTGTATGTC 10488 0 

ATTTTGCGTG CGACAGAAGT GGGATTAACG CGAAGTGTTG AG C TATTGTT TCGCCCTCGA 104940 

ATGGCTGAAT TATTAAGCAA TACAATGCTT TTAATGGTTT GTGTAACTAT CGGTGCTATT 105000 

TCACTTGG C A CGTTTTGTGC TTTTT t ACTT GAACGTTATC GCTTTTyCGG TAAAGCTTTT 105060 

TTTGAAGTGG CAATGACGTT ACCCCTTTGT ATTCCCGCCT TTGTAAGTgG TTTTACTTGG 10 512 0 

nTCAGCCTCA CATTCCGCGT TGAGGTTTTT TGGGGAACAA TTGGTATTAT GACATTAAGT 10 518 0 

TCATTTCCTT TGGCTTATTT GCCCGTTTCC GCCATACTAA AAAGACTTGA TCGTTCTCTT 10 524 0 
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GAAGAAGTAA GCCTTTCGCt TGGTAAAAGC CCTGTGTATA CCTTTTGGTA TGCTATTTCC 1053 00 
CCGCAACTTA AACCCGCCAT TGGTAGTAGT ATCTTACTGA TTGCTTTACA TATGTTGGTC 105360 
GAATTTGGTG CAGTGTCAAT TTTAAATTAT C AAAC CTTTA CCACTGCCAT TTTCCAAGAA 105420 
TATGAAATGT CTTTTAACAA CAGCACTGCT GCATTATTAT CCGCCGTTTT AATGGCGATT 105480 
TGTATACTTA TTGTTTTTGG GGAAATTTTC TTTCGTGGAA AACAAACCCT TTACCACAGT 10554 0 
GGAAAAGGCG TTACTCGTCC TTATCTCGTA AAAACACTTT C CTTTGG AAA ACAATGTTTA 105600 
AC CTTCGGAT TTTTCTCTAG CATATTCATT TTAAGCATTG GCGTGCCTGT GATTATGTTA 105660 
ATTTACTGGC TTATTGTGGG AACTTCACTT GAAAGTGCGG GTGATTTTTC AGAATTTTTA 105720 
TCGGCATTCA GCAATTCATT TATCATTTCA GGCTTAGGCG CATTGCTTAC CGTAATGTGC 105780 
GCCTTGCCAT TAGTTTGGGC AGCTGTTCGT TACCGTAGTT ATTTAACGAT TTGGATCGAC 105840 
CGCTTGCCAT ATCTATTACA CGCTGTGCCA GGCTTAGTCA TTGCCTTATC ATTAGTCTAT 105 900 
TTTTCCATCC ATTACGCTAA CGACCTATAT CAAACCTTTT TTGTGATTAT CATTGCCTAC 105960 
TTTATGCTCT ATTTGCCAAT GGCACAAACT ACATTAAGAG CGTCTTTAGA GCAACTTTCC 105 02 0 
GATCAAATTG AAAAAGTCGG ACAAAGTCTT GGGCG AAAC C CTTTCTATAT TTTTCGCACC 106080 
TTGACGCTAC CAGCCATATT ACCAGGTGTG GCAGCCGCAT TTGCTTTGGT TTTTTTGAAT 10614 0 
TTAATGAAAG AGCTTACCGC GACGCTTTTG CTCACATCAA ATGATATTAA AACCTTATCC 106200 
ATTGCTGTGT GGGAACATAC CAGCGATGCA CAATATGCTG CTGCCACCCC TTATGCGTTA 106260 
ATGCTCGTtC TTTTTTCGGG TATTCCTGTA TTTTTATTGA AAAAATATGC GTTTAAATAA 106320 
AATGATAAAT AATCCGTTAT TAACCGTTAA AAATCTCAAT AAATTTTTTA ATGAACAACA 1063 80 
AGTTCTGCAC GATATTTCAT TCAGCTTACA ACGCGGAGAA ATCCTCTTTT TACTTGGTTC 10644 0 
TTCAGGATGT GGCAAAACTA CATTATTACG TGCCATTGCA GGTTTTGAAC AACCCTCTAA 106500 
TGGCGAAATT TGGCTAAAAG AGCGGTTAAT TTTTGGCGAG AATTTTAATC TTCCAACGCA 106560 
ACAACGTCAT CTCGGTTATG TGGTGCAAGA GGGCGTACTT TTTCCTCACT TAAATGTCTA 106620 
TCG CAACATT GCTTACGGAT TAGGCAACGG CAAAGGAAAA AATAGCGAAG AAAAAACGCG 1066 80 
GATTGAACAA ATAATGCAAC TAACGGGTAT TTTTGAACTG GCGGATCGCT TTCCACATCA 106740 
ACTTTCAGGC GGACAACAAC AACGCGTAGC ATTGGCGCGT GCTTTAGCCC CTAATCCAGA 106800 
ACTGATTTTA TTAGACGAGC CTTTCAGTGC CTTAGACGAG CATCTTCGCC AACAAATTCG 106860 
CCAAGAGATG CTCCAAGCAC TTCGCCAAAG CGGTGCTTCC GCAATTTTCG TTACTCACGA 106 920 
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CCGTGATGAA TCCTTACGCT ACGCTGATAA AATCGCCATT ATTCAGCAAG GTAAAATTTT 106980 
ACAAATCGAT ACACCTCGCA CCCTTTATTG GTCGCCTAAT CATCTTGAAA CAGCAAAATT 107040 
TATGGGGGAA AGTATTGTTT TGCCTGCAAA TCTACTCGAT GAAAATACCG CTCAATGCCA 107100 
ATTAGGCAAT ATTCCTATAA AAAATAAATC AATCTCACAA AATCAAGGTA GGATTTTGCT 107160 
TCGTCCAGAA CAATTTAGCC TGTTTAAAAC ATCAGAAAAT CCAACCGCAC TTTTTAATGG 107220 
ACAAATAAAA CAAATTGAAT TTAAAGGnAA AATTACCTCC ATTCAAATTG AAATTAATGG 107280 
CTACGCAATA TGGATTGAGA ACGTTATTTC TCCCGATTTA TCTATTGGCG AT AATTTG C C 107340 
TGTTTATTTA CATAGAAAAG GGCTTTTTTA CTCCTAAGAA TAAAGCCTTA TCTTCACGAT 107400 
AAGGCTTATT TTTAAACAAT ATATTCGGAA AAAATTTGTT CAAGATGTGA TATTAAACAG 107460 
TCTGCGCCTA AAATATTTTC CCCTTTCCAC GCCTGTGGCA AAATCTCTGG AGAAAATTGC 107520 
TGGCTTGCCA ATTCAGCTAA CGGCAAATAA CTGATATGCC AAGGCTCCCG CCCCACTTTC 107580 
TTATTGGACT GCATATTCAT AAAAGGCAAA GCAAAATCAA AGTGCGGTAA ATTTTCGGCA 107640 
AGAAATTCAC TCAATTCAAA GAAGTAGCCG CCTTTTTCAT ATTCCCAAGG CTCCAGTTGT 107700 
AAAGATTGAC CTCGTGGCAA AAGATCAGGA TCAAAAATAT CCACTTCCGT TCCCCAATGA 107760 
TGACGACTAG CCGCAGTAAT GCAGACCAAC GTAAAATCGC CTGACATTTT TGCCAGTCAT 107820 
CTAATTGATT CAAATCTAAT GCCTTTCCTG CATCATCGTG TACTTTTCGC TCGCCCTTAA 107880 
ATTTACTGTT CCAAATAAGT TGTTGGCGTT CAAAATCACG AAAACTGCTC GCAGGCTGTA 107940 
AATTAAAGCC ATTTTTTGCC GCACTTTGTT GCAAAGCTTG AAATGCTTGC ACAGCTTGCG 108000 
TTTGCAAAAA ATGATTTGAT GAATGAGTTG TGGGTAAATT G ACT AAATG C TCACGGGACT 108060 
TTCCTGTAAG CATTTCTGGG GTTAATTTCA TTTTAGCTAT CCAATAAATT CACTAACATT 108120 
TTGTGGTAAA TTTCACCGCA CTTGCCAAGG TCTTCTACAC TGACACATTC ATTGACTTTA 108180 
TGAATGGTTG AATTTAACGG GCCAAATTCC ACTACTTCTG CGCCCATCAA TGCAATAAAA 108240 
CGACCGTCTG ACGTGCCGCC ACCTGTTTCA GCCTTTGGCG TTATGCCTAT GGTTTCCTCA 108300 
ATAGCGGAGG TTATCGAATC TAATAATTTA CCTGGTTTTG TTAAAAAGGG TTTGCCTGAT 108360 
AAATTC CATT CAATACGATA TTTCAAATTG TGTTTTTCTA GCATTTCAGC CACTTTTTGC 108420 
TTAATGATTT CATCCGTGAC TTCTGTGCAA TAACGCAAAT TGAACTGAAT ATATAACTCA 108480 
GCAGGAATTA CATTATTACT TCCCGTACCC GCGTGAATGT TAGCAATTTG TAGGCTTGTT 108540 
GGCGGGAAAA ATTCATTGCC TTTATCCCAT TGGTAAGTCG TTAATTCTTG CAAAAATAGC 108600 

-77.66- 

SUBSTITUTE SHEET (RULE 26) 



BNSOOCIO <WO 9633276A1J_^ 



WO 96/33276 PCT/US96/05320 

GCTGCTTTAT GGATTGGGTT TTCAGCTAAA TGCGGATAGG CTACGTGTCC TTGAATACCT 10 8660 

TGAATATAGA GATTTCCTGT AATTGAGCCG CGGCGAC CAT TTTTGACAAC ATCTCCTAAG 108 72 0 

TTTTT AG CAC TCGATGGCTC GCCAACCATA CAATAAGTAA TTTTTTCATC ACGCGCCATT 10878 0 

AACGTCTCAA CCACGTGTAT AGTTCCATCT TTTGCGGTAG CCTCTTCATC AGAGGTAATC 10884 0 

AATAATGCAA TGGTGCCTTT ATGATTAGGA TTTGCTTTTA CGTATTCTTC TGCCGCCACA 10890 0 

ATCATTGCTG CAAGGGAGCC TTTCATATCC GCCGCACCAC GCCCATAAAG CATGCCATCA 10 8960 

ATAATCTCCG CAGAAAACGG CGGAGACGAC CATTGATTTT CAT CG CCAGT AGGCACAACA 10 9020 

TCCGTATGAC CTGCAAAAGC AATAACTGGC TCGCTCGTTC CGTGTTTTGC CCACAAATTC 10908 0 

AACGTGTCAT TAAAAGGCAT CCATTCAATT TGAAAACCTA ATTTTTCTAA ACGTTCGGCA 10914 0 

ATAATTTGCT GACAGCCTTC GTCATTTGGG CTAATAGACG GGCGACGAAT TAAATCTTGT 10 9200 

GC CAACG AAA CCACTTTTTC TTTCATAAAA ACTCCTAAGA AAATAACCGC ACTTTAAGCA 109260 

AAAGCAGCCT GATATTCCTT TTCATTAAAT CCAATTAATG CCTTTCCATC TTGTAAAATA 109320 

ATTGGGCGTT TAATTAAAGT TGGATTCTCA GC CAACACGG AAAGTGCGGT GGTTTTATCA 10 9380 

AGAGAATTTT TAACTTGCTC AT CCAAATTA CGCCAAGTTG TACTGCGTTT ATTCACTAAC 109440 

ACATCCCAGC CAAATTGAGT TTCCGCTTGG GTTAAAAAAT TTAAATCTAA CCCATCAACA 10 9500 

CGGTAGTCGT GAAGTTTGTG TTCGATATTG TGATCCGCCA GCCATTTCAA CGCTTTCTTT 10 9560 

ACTGTATCGC AATTTTTAAT GCCGTAAACT GTAATCATAT TTCACCTAAA AAAGAAAAAT 10 9620 

C CTCAATCT A ATCAGACTGA GGATTTTAAC CCTTATGCTA CTTCATCTCA AATGAATTAA 10 9680 

CCTAACAACT TATTGATACG TTTAATAAAC GCAGCTGGAT TTTCCAAAGA TCCGCGCTCT 10 9740 

GCTAACATAG CTTGTTCAAG TAATAATTCC AC C CAATCAG CAAATTCAGT TTCATCTGCA 10 9800 

ATATCAG C C A CTTTTTTCAC TAAATGGTGT TCTGGATTCA GTTCAAATGT GTATTTTACT 109860 

TCTGGTACAG GTTGCCCTGC GGCTGCAAAC AATTTCGCCA TTTGGGTCGT CATTTGATCG 10 992 0 

TTATCCGTAG AAACCACCGC TGGTGTATCC GTTAAATTGT GAGTTAAACG CACCGTTTTC 10 9980 

ACACGTTCGC CAAGCAAGTT TTTCACACGC TCAATAAAAC TACCAAAGGC TTCATCTTGT 11004 0 

TGTTTTTGTG TTTCAGATTC TTTATCCGCT AAATCGCCTA AATCCAAATC TGCTTTTGTA 11010 0 

ATACTTTGTA ATTGTTTACC ATCGAATTCA GTTAAATAGC TTAACATCCA TTCATCAATG 11016 0 

CGATCGGAAA GCAACAAAAC CTCAATGCCT TTTTT ATTGA ATAATTCCAA GTGCGGGCTA 1102 20 

TTTTTCGCTG CAACATAACT ATCCGCTGTA ATGTAATAGA TAGCTTTTTG CCCCTCTTTC 110280 
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ATACGCAAGA TGTAATCTTC TAAAGAGACA GTTTGCTCGC TACCATCATT ATGTGTCGAA 110 34 0 

GCAAAACGCA ATAACTTCGC GACAGTTTCT TTGTTAGCAA AATCTTCTGC AGGGCCTTCT 1104 00 

TTTAAGACTA AAC CAAACTC TTTCCAGAAT TGAAGATATT TTTCTGCATC ATCTTTTGCT 110460 

AATTTTTCTA GCATTTGTAA TGAACGCTTA GTTAATGCTT TGCGTAGTGC AGCCGTAATT 11052 0 

TTGTTATCCT GTAAAATTTC ACGAGATACA TTTAATGGCA AGTCATTGCT ATCGATTAAA 110 58 0 

CCACGCATAA AACGTAGATA ATTCGGTATA AATTGCTCAG CATCATCCAT AATAAATACA 110640 

CGTTGAACAT AAAGTTTTAA GCCATGTTTA TGTTCTCGAT TAAATAAATC CCAAGGTGCT 110 7 00 

TT AG CTGGT A CATAAAGCAA ACTAGTATAT GCTTGATTTC CCTCAACTTT GTTATGCGCC 11076 0 

CAAGTCACAG GATCAACAAA ATCATGGCTT AAGTGTTTGT AAAACGCTTT GTATTCCTCA 110820 

TCAGACACAT CATTTTTAGA ACGCGTCCAA AGGGCATCAG ATTTATTGAT TTTTTCCCAT 11088 0 

TTTTCGCCGC ACTCTTTGCC TTCATCATCG TATTCTTTCG TCAGCATTTC CACTGGCAAG 110 94 0 

CCAATATGGT CAGAATATTT ACCAATAATT TCACGCAAAC GCCATTCATT TAAAAATTCT 111000 

TTTTCATCTT CACGTAAATG CAAAATCACA TCTGTACCAC GGGATTTTTT CTCAATATCC 11106 0 

GCCACAGAAT ATTCGCCTTC ACCCGCAGAT TCCCAAAGTA CGGCTTTATC CGCCTCTTCA 11112 0 

CCAGCTGCTC TGGTTTTTAC AGTCACTTTA TCTGCCACAA TAAAAGCAGA ATAAAAACCC 11118 0 

ACACCAAACT GCCCAATAAG CTGGCTATTT TTTGCTTGAT CTTGAC CAAG TGCGGTTAAA 11124 0 

AATTCTTTTG TTCCTGATTT TGCAATCGTA C C CAAATG AT CGATGACTTG CTCACGGGTC 11130 0 

ATGCCAATGC CGTTATCGCT AATTGTGATA GTGCCTTTAT CCGCATCAAA GCTAACACGC 11136 0 

ACACGCAAGT CGCCATCGCC TTCATATAAA GCAGGGTTGG AAAGTGCTTT AAAGCGTAAT 1114 2 0 

TTATCTGCCG CATCAGAAGC ATTAGAAATC AATTCACGTA AGAAAATTTC TTTGTTAGAG 1114 8 0 

TACAAAGAAT GAATCATTAA TTGGAGTAGT TGTTTGACTT CAGATTGGAA TCCGCGAGTT 11154 0 

TCTTGATTTT GTGACATAAT TTTTTCCTTC ATAAAGATAA AAATTGAACG TCACAGATAT 1116 0 0 

AAGAATAAAA AGAAAGATTT CAAGAGGAAT TTAAGAGGAA AAGTGCGGTA AAAAAACACA 1116 6 0 

AGGAATTTAA ACCGCACTTT AAATGAAATA AAAAATTTCT TATCCCCCTT GATCG CTATA 11172 0 

AATTTATGTA TATAATGAAT TCTGTTTTAG ATACTAGATA GTAATTTAAT AGTAATGAAA 1117 8 0 

CTTGGCTAGT GCATTCCCCC ACTCCTTTCG CACTAGCCAT TTTTTTGGGA ATTTTGTACC 1118 4 0 

ATATCCCAAC TTTTTCCAAT CAACGATCCA TTATGAATAA TCTTGAACTT GAACAACTCA 111900 

TTAATCAAAA ACTTTCTTCT GATAAAATCA ATG ACT ATG C GCCCAACGGC TTACAAGTCG 11196 0 
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AAGGTAAAAC AGAAATCAAA AAAATCATTA CTGGCGTAAC GGCGAGTCAG GCTTTAATCA 112020 
ATTACGCAAT CAGCCAAAAT GCCGATGCTA TTTTAGTTCA TCATGGCTAT TTTTGGAAAA 112080 
GTGAAACCCC TTGTATCCGA GGAATGAAAG GCAAGCGAAT TAAAGCCCTT TTAGTGAACG 11 2140 
ACATTAATTT AT ATGG CTAT CATTTGCCAT TAGATGTGCA CCCAGAATTA GGTAACAATG 112200 
CACAATTAGC CAAACTGCTC GATATTGAAA ATTTACAACC ATTAGAAAAA GGCTCTGTGA 1X2260 
GTATTCCTGT TTGGGGCGAA TTAAAAGAAC CGATGACAGG TAAAGACTTT GCAGAAAAAA 1.2320 
TAGAAAAAGT GTTAAATCGA AAACCGTTAA TTTGCATTGA AAACGGACCG CACTTAATCC 112380 
GAAAAATTGG CATCTGTACC GGTGGCGGAC AAGGTTATAT TGATCTTGCT GCAGAACAAG 112440 
GCTGCGACGC ATTTATTACA GGGGAAGTGT CAGAACAAAC GATTCATTCT GCCCGTGAGC 112S0O 
AAGGATTGTA TTTCTTCTCT GCAGGTCATC ACGCAACAGA ACGCTATGGC ATCAAGGCAT 112560 
TAGGCGAATG GTTGG CG AAA GAATATGGCT TTGACGTAGA ATTTAAAGAT ATAGATAATC 112620 
CAGCATAAAA TACAAAAGTG CGGTAAAAAT ATTTCTTTTT TAACCGCACT TTTATCTTCT 112680 
GAAATTATCG TTTAAACATT CCGCCTAAAC CGCCAAGACC GCCTAAGCCT CCGCCCATTA 112740 
AGCCTTGCAT TCCACGCATC ATTTTAGCCA TGCCGCCTTT ACG CATTTT C TTCATCATAC 112800 
GTTGCATTTC ATCGAATTGT TTAAGTAATT TATTGACATC TTGCACTTGA GTGCCAGAGC 112860 
CTAATGCAAT ACGACGACGG CGAGAtCCTT TGATAATATC TGGGTTGGCA CGTTCTTTTA 112920 
GGGTCATGGA GTTAATGATC GCTTCCATTT TGACAAACAT TTTGTCATCT ACTTGATTTT 112980 
TAACGTGCTC AGACAAATTT TTTGCACCTG GTAATTTCTC AAG CATAG AC ATCATGCCGC 113040 
CCATTTTTTT CATTTCGATC AGCTGTTCGC GGAAATCATC TAAAGTAAAA TCATCGCCTT 113100 
TCTTGAATTT CTGCGCCATT TTTTCCGCTT TTTCACGATC AACAGAACGT TCAAGATCTT 113160 
CGATAAGGGA AAGCACATCG CCCATGCCCA AAATACGGGA AGCGAcACGA TCAGGATGGA 113220 
ATGGCTCAAG CGCCTCTGTT TTCTCGCCCA CACCCAAGAA TTTGATTGGT TTACCTGTGA 113280 
TTTGACGAAT CGATAACGCC GCACCACCGC GCGCATCACC GTCCACTTTC GTCAAAATAA 113340 
CCCCTGTAAG AGGCAATGCT TCATTAAAGG CTTTTGCTGT ATTTGCCGCA TCTTGACCAG 113400 
TCATCGCATC AACAGTGAAA AGAGTTTCGA TTGGATTTAA TGCAGCATGG ACTTG CTTGA 113460 
TTTCGTCCAT CATTTCTGTA TCAACGTGTA GGCGACCTGC CGTATCCACA ATCAACACAT 113S20 
CGTAGAATTT CAGTTTTGCA TCAGCAAGCG CCGATTTAGC AATATCAACG GGGTTTTGTT 113580 
TAACATCCGA TGGAAAAAAA TCCACGCCAA CGGATTGAGC CAAGGTTTCA AGTTGCTTAA 113640 
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TCGCAGCAGG GCGATATACG TCAGCAGACA CCACTAACAC TTTCTTTTTA TGGCGTTCAC 113 7 00 

GCAAGAATTT TGCCAATTTA CCCACACTGG TGGTTTTACC CGCCCCTTGT AAACCCGCCA 11376 0 

TTAAGATAAC TGCTGGTGGT TGGGTTGCGA GATTTAAACT CTCATTCGCT TCGCCCATGG 11382 0 

CTTTTTCAAG CTCACGCTGA ACGATTTTTA AGAATTCTTG CCCTGGCGTT AAACTTTTAT 11388 0 

TGACTTCTTC CCCAAGCGCG CTTTCTTTTA CTTTTGCGAT AAATTCACGC ACCACAGGCA 113 94 0 

AGGCAACATC GGCTTCAAGT AATG C CAT AC GCACTTCGCG TAAGGTTTCT TTAATATTAT 114 00 0 

CTTCCGTTAA ACGGCCTTTT CCTGTGATAT TACGTAAGGT TTTGGAAAGG CGATCCGATA 114 06 0 

AATTCTCAAA CATGGCTAAC AATCCTGTTT TTCTTAAAAA ATTGTGCTGA TTATACCGAA 11412 0 

ATTTAGG CGT TTTTCATCAT TTCAAATGAA AATGACTTTT TGAGTAGGCG ATTTCATAAA 11418 0 

GAAAGTATAG GACATAAGGA AAAAGAATGA TTATTTAGCC AATTTCAGGT ATTATTCTGC 114 24 0 

CCACTCTTAG CTATTTGAGC AGCTATTTGA TTTAGCAAAA TCAAATAACA GGTAAAAAGT 114 3 00 

GCGGTGAAAA ATGACCGCAC TTTTTGTTGA TTAACGAACA AAAGGATTTT C CCTTGGAT A 114360 

GCATTCCCCT TAGTACTTTA TTTATT ATT C TTATTATCTG TTTAGTGTTA TCAGCCTATT 114 42 0 

TTtCTGGTTC TGAAACTGGA TTACTCTCCC TCAATAAATA TCGTCTTCGT TTTCTATCTG 11448 0 

AACAAGGCAA TAAAGGCGCG AAAAAAGCGG AAAAACTGCT AG AAAAG CCA GATACCTTGT 114540 

TAAGTTTTAT TCTTATTTTT AATAACCTTG TGAATATCAG CGCATCTGCA ATTGCAACAG 1146 0 0 

TAATTGGTAT GCGTTTATAT GGCGATGCCG GCGTGGCGAT TGCAACGGGT TTGCTCACTT 1146 6 0 

TCGTAATGCT CGTATTTTCT GAAATATTCC CCAAAACTGT GGCTGCAATG CACGCAGAAA 114 72 0 

AAGTGAGTTT TTTTTCAAGT CATATTTTAA CTTCATTGCT TAAAATCTTT TATCCGCTCG 11478 0 

TTTGGTTAAT GAATATTTTT ACTAAATCCT TGATGCAAAT AGTAGGGTTA AAACTTGATA 114 84 0 

TGCAAAAACA AGTAATTAGT AGTGAAGAAC TGCGTAGTAT TGTATCAGAa GCGGGCGAAG 114 900 

C AAC AC C AAA TGAACAACAT CCACAAATGT TGCTGTCTAT TTTGGATATG GAAACCGTCA 114 96 0 

CCGTTGATGA CATTATGGTT CCCCGCAACG AAATCGGCGG TATAAATATT GATGATGATT 11502 0 

GGCGAGCCAT TATGCGCCAC TCAATCATGC TG C AC AT AAC CGCGTGGTGC TTTACAAAGG 11508 0 

CAGCCTTGAT GAACAAGTGC TTGGCATTTT GCGCGTGCGT GAAGCATTCC GCTTGCTCCT 11514 0 

TGAAAAAAAT GAATTTACCA AAGAAACCTT AATTCGTGCC GCAGATGAAG TGTATTTCAT 1152 00 

TCCTGAAAGC ACGCCATTAA AAACGCAACT TGCAAATTTC CGTACAAATA AAGAGCGAAT 115260 

TGGCTTAGTT GTAGATGAAT ACGGCGATAT TAAAGGGCTT GTGACGTTGG AAGATATTTT 1153 2 0 
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ATGAAGTAAG AATTTTTGTA AATTCATGAT ACGCAGAATC TGTCGCCTTA CGCTCAATTT 117060 
TTACCATATC AACGGC AT C A GCATCACCAC TAACTAAATG GGG CGTGG AT TCTAATACAA 11712 0 
AGTGCGGTCA ATTTTnCTTT ATTTTTATAC GTTGGCTTTT AAACTGGTTT ATTTGAAGAA 117180 
AGCACGCTAG AATAAGGTTG TTTTCATTTC AACGAAAAAA GTAAGGAAAT TCTATGATCG 11724 0 
ATCCAAATTT ACTCCGTAAT AATTT AG CAG AAGTCGCAGA AAAATTAAAA GTAAAACGCA 117300 
ACTTTATGCT TGATACGGAA AAACTCACCG CGTTAGAAGA TCAGCGCAAA AACTTACAAG 117360 
TCACTACCGA AAACTTACAA GCGGAACGTA ATGCTCGTTC CAAAGC C ATT GGTGCTGCTA 117420 

AAGCGCGTGG TGAAGATATC GCACCATTAT TGG CAGAAAT GGATGATATG GGCAATCAAT 117480 

TGACTGAAGC AAAAGCACAA TTAGATGCGG TACTTGCAGA AATCAATCAA ATTGCATTAA 117540 

GTATTCCAAA TCTACCCGCA GACGAAGTGC CATTAGGCAA AGATGACACA GAAAATAAAG 117600 

AAATCTTACG TTGGGGTACA CCGCGTACAT TTGATTTTGA AGTGAAAGAT CATATCACTT 117660 

TGGGTGAAGA AGCAAATGGC TTAGATTTTG CTGCTGGGGC TAAATTAGCT GGTGCACGTT 117720 

TTGC CGTAAT GAAAGGTCAA ATCGCTAAAA TGCACCGTGC ATTGGCACAA TTCATGTTAG 117780 

ATCTTCATAC GGAACAACAC GGCTATTTAG AAACCTATGT GCCTTATTTA GTAAATCACG 117840 

CCACACTTTA TGGAACAGGG CAATTACCAA AATTCGGGGA AGATTTATTC CATACTTTAG 117 900 

CCTTAGAAGG CGAACAACCT TACGCATTAA TTC CAACTGC AGAAGTGCCC GTCACTAATT 117960 

TAGTGCGTGA CGTGATTATT GATGAAGCAG AATT AC CAAT AAAAATG AC C GCACATACGC 118020 

CATGTTTCCG TTCTGAAGCA GGTTCTTACG GTCGTGATAC TCGTGGTTTA ATTCGTATGC 118080 

ACCAATTTGA TAAAGTAGAA ATGGTACAAA TCGTTGATCC AGATAAATCA ATGGAAGCAC 11814 0 

TTGAAGAATT AACAGGTCAT G CGG AAAAAG TATTGCAATT ATTAAATTTA CCATACCGTA 1182 00 

AAGTTTTACT TTGCACAGGC GATATGGGCT TTGGTTCTTG CAAAACTTAC GATTTAGAAG 11826 0 

TTTGGGTGCC TGCACAAAAT ACTTATCGAG AAATTTCTTC TTGCTCAAAT ATGTGGGATT 118320 

TCCAAGCTCG CCGTATGCAA GCACGTTGTA AAG CAAAAGG CGATAAGAAA ACTCGCTTAG 1183 80 

TG C AC AC ATT AAATGGTTCA GGCTTAGCGG TTGGTCGTAC TTTAGTGGCT GTACTTGAAA 118440 

ACTATCAAAA TGCCGATGGT TCAATTACCG TGCCAGAAGA GCTACGTCCA TATATGGGCG 118500 

GATTAGACGT TATCGGAAAA TAATCCTTCT AGAAGATAAA AAAGCGTGGA TTTTAAAATT 118560 

TCCACGCTTT TTTATTATCA AAGCCTTACA ATAACTGTTT TAAATTTTTT GGTATTCAAG 118620 

CCAATTAGGA ACTTTATTCT ATTATAAAAT TATATCTATA ACATAGAAAA TTTATCTTTC 118680 
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AATTGACATC AATATCCAGA ACATTACTCT AAAAAACATA TCAAAGTATC TAAATTGTTA 118 74 0 

TAATATTAGT T CACTTT AC G ATTAAACAAT AGTAAGAAGC AATTTTAATA AGTTAGATAT 1188 00 

TCCACGTATA AAACAGTATA ATTCCATCAA TAGCTCATAA ACCAACAGGA GTATTTTATG 118 86 0 

AAATTATATG GTTTAATTGG TGCGTGTTCA TTTGTTCCAC ATGTGGCATT AGAGTGGGTA 11892 0 

AAAATACGTG AAAATGCTGA TTATGAATTT GAACCTGTCA CTCGTGAGTT AATTAAATCA 118 980 

C CTGAATTTT TATCATTGAA TCCACGCGGG GCAGTACCTG TATTAGTTGA TGGCGATTTA 11904 0 

GTTCTTTCGC AGAACCAAGC GATTTTGCAC TATTTAGATG AACTTTATCC TAATTCAAAA 119100 

TTATTTGGTA GCAAAACCGT ACGTGATAAA GCCAAAGCAG CTCGCTGGTT AGCATTTTTT 119160 

AATTCCGATG TGCATAAATC ATTTGTGCCT CTGTTCCGTT TGCCAAACTA TGCAAAAGAT 11922 0 

AACGAAACAC TTGCACACAC TATTCGCCAA CAAGCAGTAG AG C AAATTTT AGATCAACTT 119280 

GCAGTCGCAA ATGAGCATTT AGAAAGTCAC ATTTATTTTG GAGAAAATAT TTCCGTTGCA 119340 

GATGCCTATC TTTACATTAT GCTAAACTGG TG CAAAGCAG TCAAAATTGA TTTCAGCCAC 119400 

TTAACACAAC TTTCGGCTTT TATGCAACGT GTTGAGACTG ATCAAGCTGT CGAAAATGTG 119460 

AGAAAAAGTG AAGAACTAAA AGTATAAACA GGAAGTTTCT ACAAAAATTG CCCGTTTTTA 11952 0 

ACCGTTTGTA TCACGTTAAG TTTAAATTTA AGTGAGTAAT TTCGCGTCTT ACCCAACACA 119580 

GCTAACCCAT TGATTCTACT ATGATTAAAC TGGTTAATCC AATGTCGCAA TGtTCTACTG 119640 

TCAAGTTGAA AATGTTTACT GATGAGTGAA TAATTTTTAC CATTTTGAAT ATAAAATTCG 119700 

ATGACTTGTT GTTTGAAAAG AGTGTTGTAT TTGGCTATAA AAAATCTGCn TCTTACAATT 119760 

AATTTATATA AGTAAAAATA TCTTGCTTTT ATCATCGTTA AACTTAGAAT AAATATAAAT 119820 

AATTATCATT AAGATTTAAG GGTATATATG AATATTTTAA TTAATAAACG TATCTTTTTA 119880 

TTAGTTACCC TTGTAGGTAT TCAACTAAAT GTAACCGCAA AACnAAATTC TTCTAATTCT 11994 0 

AATAGAGAGG AGTTATTACC TATTATTGTG AACACTAATG ATGATAGTAA TAAACTACCT 1200 00 

GGTCGTTCAG TATTAAAACA GAAAAATATC GAACAACnAC AAGCGGATAA TGCAGCAAAT 12 006 0 

TTAATTAATA TCTTGCCAGG TGTGAATATG GCTGGCGGGT TTCGCCCTGG TGGGCAAACA 120120 

TTAAATATTA ATGGAATGGG TGATGCTGAA GATGTTAGAG TGCAGTTAGA TGGGGCTACA 12 018 0 

AAAAGTTTTG AAAAATACCA ACAGGGTTCT ATTTTTATTG AGCCTGAATT ATTAAGAAGA 12 024 0 

GTAACAGTAG ACAAAGGAAA TTATTCTCCT CAATATGGTA ATGGTGGTTT TGCTGGAACA 12 0300 

GTAAAATTTG AAACTAAGGA TGCTAGGGAT TTCTTGCAAG AAAACCAGAA AATAGGTGGT 12 036 0 
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TTTTTAAAGT ATGGAAATAA TAGCAATAAT AACCAAAAAA CTTATAGTAC TGCACTAGTT 12 04 20 
TTACAAAATG AACAAAAAAA TATTGATTTA TTACTATTTG GTTCTGTAAG AAATGCTGGC 12 04 80 
GATTATAAAA GACCCGATAA TAGCAAAATT CTTTTTTCAA AAAATAATCA AAAAACCGGA 120540 
TTAATAAAAT TAAATTGGCA AATTTCACCC GAGCATTTAC TCACATTATC GAGTGTTTAT 120600 
GGTATTCATA AAGGCTGGGA GCCTTTCGCA GCGAAAAGAG ATATCCTACC TAAACCAAGT 120660 
TTGAGTGATA TAATGCGTTA TGGTACTGAT ATAGCGTGGA AACGTAAACT GGTTTATCGA 12 072 0 
GATCAAAAAG ATGAAAATTA CACATTAAAA TACAACTATC TACCTGAAAA TAACCCATGG 12 078 0 
ATTAACTTAT CTACTCAGTT CAGCTACAGT AAAACCACAC AAAATGATAT GCGTCCCAAA 120840 
GAAGCATCTT CAGGATTAGT GGGTAGCTTA GGTAATCAAA GCTGGATAAC TTATTCAGAT 120 900 
CTTACTTTTG ATATAAATAA CACAAGTACT TTTAATATCA AAACTACTGT TCATGAACTA 12 0 96 0 
TTATTTGGTT TGCAATGGTT AAAAAACACA AGAAATACTT TAATGTATGA TAAAAGTAAA 12102 0 
GTGAGAAAAG CGGATTATAA TTATGGCTAT TTCCAACCCT ATTATATGCC ATCTGGACGT 12108 0 
CAATATACAC AGGCGTTTTA TTTACAAGAT CAAATAAAAT GGAAGAATAT AATTTTTAGC 12114 0 
ACAGGAGTGC GTTATGACCA TATCAATAAT ATAGGGCAGA AAAATTTAGC ACTAAAATAT 121200 
AATGATATCT CTGCAGGACA TGATTATAGC CAGAAAAATT ATAATGGTTG GTCTTATTAT 121260 
TTAGGTTTAA ATTATGATGT AAACCATTAT TTAAGTTTAT TTACGAATTT TAGTAAAACT 12132 0 
TGGCGAG CAC CTGTTATTGA TGAnCAGTAT GAGACGCAAT TTAAGCAATC TTCTGTGCCT 121380 
GCAACCTCTT TAAATTTAGA AAAAGAAATG ATTAATCAAA CGAGAGTGGG TGGAATTATT 12144 0 
ACTCTCAATC ATCTATTTCA GGAAAATGAT GCTTTTCAAT TTAGAACTAC TTATTTTTAC 121500 
AATCGCGGCA AGAATGAAAT ATTTAAAACC AGAGGAGTTA ATTGTGTAGG CAATGCTGCA 121560 
GATACCAATA ATAAGGTTTG TCCTAAAATC ATTGAAAATT ATCGTAATTT GCCAGGTTAT 12162 0 
GTTATTCAAG GTGCAGAGTT AGAAGCTTAT TATCAATCTA CTTATTTATT TGGTGAGATA 12168 0 
ACATATTCTT ATGTAAAAGG AAAACGTGAC ACCTCACCAA GAAACCCATG GGGT AAAAC C 121740 
TCAACATGGA TTGCTGAAAT TCCCCCTAGA AAAGCAACTA CTGCTTTAGG TTTCAATGTT 121800 
CCAAAATATT ATCTCACAGT AGGTTGGCGT GCTGAGTTTG TAAGAAGACA AGATCGATCG 12186 0 
CCATTATCTG GTGATCCTAA AGCATCATCT TGGTCATTGC CAGCTTCAAG AGGATACAGC 121920 
CTACATAACC TATTCCTTTC TTGGAGCCCT GCAAAAATTA AAGGTATGAA TGTTAAGATT 12198 0 
ACAGTGGATA ATTTATTCAA TCGAGCATAC AACCCTTATC TAGGCGAATT GGCTTCTGGA 122 04 0 
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ACAGGTAGGA ATATCAAATT TAGCCTATCT CAGAAATTTT AGAATAAATA GCCCGATTGA 12 2100 

ATCTAATCGG GCTTTATCAT AACTATAATT AGATCATAAA AAAATGTAGA GATAATAAAT 122160 

ATAATTnCCT ATACTAATTT TAGTATTCTT TAAAG AG CAA ATTTAAATAA T AAAT C C AAA 122220 

TAAGTACGAC ATCTTAAATT ATAGATAAAT ATCAAAATTA TGTTTTATCT TTACCCTATT 1222 8 0 

ACTTTCCATA ATAGAAACTA AACTTACCTA AAAATTTAAA TATAATTTAG CATCTnGCTT 12234 0 

AACAACAAAG T C AAT AC AC A TTTCTTCTAG TTTTC CTTT C TTTCCGATTA ATATACCCCA 1224 00 

AACTATATAG CCTGTTGTTC TCACTAACTT TCCTACTATA GGAATATCAT TAGCATAAGC 122460 

TATGAAGAAG AAAGTTAAAA CAGAGGCATT ATGTAATATA AGTTATTTAT ATAATTTAAC 122 52 0 

ATAACTTTCC ATATAACAAC AACGCATCTT TATAACATAA TTGCAATGTA TTCTGTTTTA 12 258 0 

AATTT AT AG C AATTACCTAA ATAGTAAAGA GTACGAATAA AGTACATATA GGCTATTAAT 12264 0 

TACTTTAAAA AAAATTAAAA TATATTTGTA GATTTATGGA CAATTACATT TATCCTTTAA 122 700 

AAGGTAGTTA ATCTAGAGAA ATTTTTTATA GCAATACAAC AAGTTTACAA GGAATTCAAA 12 2 76 0 

TATTTTTAAT TTTTAGAAAA AAACAAAAAC CCTAGCTTTT CAACTAGGGT TCTGTTTTTA 122 82 0 

TGTGATTTAG ATTTGTATCT GATAACCTGG CGGTGCCCTA CTCTCACATG GGGAAACCCC 1228 8 0 

ACACTACCAT CGGCATTACA GCGTTTCACT TCTGAGTTCG GTATGGTCTC AGGTGGGTCC 122940 

ACCGCACTAT CGCCGCCAGG ATAATTCTTT GATAACTCGT CTTTTCTCTT TTATCTCTCG 12 3 000 

TCTCTTTTAT CTCTCGTTAC TCAGTCTTTC CTCTTACCTT TCACTCGTTC TTATTGGTCA 12 306 0 

CAAGCTGAAT CCGATTTTCT TTCTATTAAG TCTTCTATAT TTACTTTTCT TTGTTTAGTC 12312 0 

TTTTACTTCT CTTTCGCTCT G CTTTT ATTT GCTTCGCTTT TCATCTCTTA CTTCACTTAG 123180 

CTCCCAAAAA CACTTGAGCG TTGTATAGTT AAGCCTCTCG GGCAATTAGT ATTGGTTAGC 123 24 0 

TCAATGACTC ACATCACTTA CACACCCAAC CTATCTACGT CGTAGTCTAC AACAACCCTT 12 33 00 

ACTGACTTAA AGTCAGGGAT GACTCATCTC TTGGCAAGTT TCGTGCTTAG ATGCTTTCAG 123 36 0 

CACTTATCTC TTCCGCATTT AGCTACCCGG CAATGCGTCT GGCGACACAA CCGGAACACC 12 342 0 

AGTGATGCGT CCACTCCGGT CCTCTCGTAC TAGGAGCAGC CCCAATCAAT CATCCTACGC 12 348 0 

CCACGGCAGA TAGGGACCGA ACTGTCTCAC GACGTTCTAA ACCCAGCTCG CGTACCACTT 12 3 54 0 

TAAATGGCGA ACAGC CATAC C CTTGGG AC C TACTTCAGCC CCAGGATGTG ATGAGCCGAC 12 36 00 

ATCGAGGTGC CAAACACCGC CGTCGATATG AACTCTTGGG CGGTATCAGC CTGTTATCCC 123 66 0 

CGGAGTACCT TTTATCCGTT GAGCGATGGC CCTTCCATTC AGAACCACCG GATCACTATG 12 3 72 0 
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ACCTACTTTC GTACCTGCTC GACTTGTCTG TCTCGCAGTT AAGCTTGCTT ATACCATTGC 12 3 78 0 

ACTAACCTCA CGATGTCCGA CCGTGATTAG CAAACCTTCG TGCTCCTCCG TTACG CTTTG 12384 0 

GGAGGAGACC GCCCCAGTCA AACTACCCAC CAGACACTGT CCGAGACCAC GTTTCGTCAT 12 3 900 

CTTCGTTAGA ACATCAAACG TTAAAGGGTG GTATTTCAAG GTCGCCTCCA CAATGACTGG 12 3 96 0 

CGTCACTGCT TCAAAGGCTC CCACCTATCC TACACATCAA AATTCAATGT TCAGTGTCAA 124 02 0 

GCTATAGTAA AGGTTCACGG GGTCTTTCCG TCTAGCCGCG GGTACACCGC ATCTTCACGG 124 080 

CGATTTCAAT TTCACTGAGT CTCGGGTGGA GACAGCCTGG CCATCATTAT GCCATTCGTG 12414 0 

CAGGTCGGAA CTTACCCGAC AAGGAATTT C GCTACCTTAG GACCGTTATA GTTACGGCCG 124200 

CCGTTTACTG GGGCTTCGAT CAGGTGCTTC TCTTTCGATG ACACCATCAA TTAACCTTCC 12426 0 

AGCACCGGGC AGGCATCACA CCCTATACGT CCACTTTCGT GTTTGCAGAG TGCTGTGTTT 1243 20 

TTAATAAACA GTTGCAGCCA GCTGGTATCT TCGACCGGTT CAACCTTCAG GGGCTAGCCC 1243 8 0 

TTACAATCTA CGCCGGCGCA CCTTCTCCCG AAGTTACGGT GCTATTTTGC CTAGTTCCTT 12 444 0 

CACCCGAGTT CTCTCAAGCG CCTGAGTATT CTCTACCTGA CCACCTGTGT CGGTTTTCAG 124 500 

TACGGTTTAG TAAAGCCTTT CGCTTAGTGG CTTTTCCTGG AAGTGTGGTA TCAGTTACTT 124 56 0 

CAGCTCCGTA GAGCCTCGTC ATCATCTCTC TGTGTTAAGG AAGTC CGGAT TTGCCTAAAC 12 46 2 0 

TTCCCACCTA C C AACTT AAA CGCACATATC CAACAGTGCG ATAACCTAAC CTACTCCGTC 1246 80 

CCCACATCGC AGCTTTACCA AGTACAGGAA TATTAACCTG TTTCCCATCG ACTACGCTTT 124 74 0 

TCAGCCTCGC CTTAGGAGCC GACTCACCCT GCCCCGATTA ACGTTGGACA GGAACCCTTG 124 8 00 

GTCTTCCGGC GAACGGGTTT TTCACCCGTT TTATCGTTAC TTATGTCAGC ATTCGCACTT 12 4 86 0 

GTGATACGTC CAGCCCACCT CTCGATGAAC CTTCTTCCGC TTACACAACG CTCCCCTACC 12-1920 

CAACAGAATA AATTCTGATG CCGCAGCTTC GGTGCTATAT TTCAGCCCCG TTACATCTTC 124 98 0 

CGCGCAGGCC GACTCGACTA GTGAGCTATT ACGCTTTCTT TAAATGATGG CTGCTTCTAA 12 504 0 

GCCAACATCC TAGCTGTCTA AGCCTTCCCA CTTCGTTTCC CACTTAATAT AGACTTGGGG 12 5100 

ACCTTAGCTG GcGGTCTGGG TTGTTTCCCT CTCCACGACG GACGTTAGCA CCCGCCGTGT 12 516 0 

GTCTCCTGAG TATCACTCTT TGGTATTCGT AGTTTGCATC GGGTTGGTAA GCCGGGATGG 12 522 0 

CCCCCTAGCC GAAACAGTGC TCTACCCCCA AAGGTGTCAC CTCAAGGCTC TACCTAAATA 12 52 8 0 

GATTTCGGGG AGAACCAGCT ATCTCCCGGT TTGATTGGCC TTTCACCCCC AGCCACAAGT 12 534 0 

CATCCGCTAA TTTTTCAACA TTAGTCGGTT CGGTCCTCCA GTTAGTGTTA CCCAACCTTC 12 5400 
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AACCTGCCCA TGGCTAGATC ACCGGGTTTC GGGTCTATAC CTTGCAACTA TTCGCCCAGT 12 5460 

TAAGACTCGG TTTCCCTTCG GCTCCCCTAT TCGGTTAACC TCGCTACAAA ATATAAGTCG 125520 

CTGACCCATT ATACAAAAGG TACGCAGTCA CAAGGTTGCC CTCGCTCCCA CTGCTTGTAC 12558 0 

GTACAAGGTT TCAGGTTCTA TTTCACTCCC CTCACCGGGG TTCTTTTCGC CTTTCCTTCA 12564 0 

CAGTACTGGT TCACTATCGG TCAATCAGGA GTATTTAGCC TTGGAGGATG GGCCCCCCTT 12 5 7 00 

CTTCAAACAG GATATCACGT GTCCCGCCCT ACTTCTCGTT AGCTTAGTAC CACAATATGG 125 76 0 

TTTTTAGATA CGGGATTATC ACCCTCTTTG ATTGTGCTTC CCAACACATT CTCCTCACTA 12 582 0 

TATTGCTATC ACTTACTGGC TCTTTCGCTT TCGCTCGCCG CTACTGACGA AATCTCGGTT 125880 

GATTTCTTTT CCTCGGGGTA CTTAGATGTT TCAGTTCTCC CGGTtTGCCT CAACAAGCTA 12 5 94 0 

TGAATTCACT TGTTGATAGT AGATTCTTCA TCTACTGGGT TTCCCCATTC GGATATCTTG 12 6 000 

GATTAAACGC CTCTTATCGA CTCATC CAAG CTTTTCGCAG ATTAGCACGT CCTTCTTCGC 126060 

CTCTGATTGC CAAGGCATCC ACCTTGTACG CTTAGTCACT TAACTATACA ACCTCAAATG 12612 0 

TTTTCAATCA CTTTACGTTT TCACTTCAAA GTGCGGTCAA TTTGATGCGT ATTTTCATTC 12618 0 

AACTAAACAC TTGATTGCTT TtGTTCAATC AAGATTTTTT TACAAAACAG GTTTTGTTTT 12624 0 

TAGAGTTTCC TTTTCAGTTC TCTCT t TCAC TTTACTGTTt TGCCTACTCA GACTTTCTCT 126300 

CGAAAATCTC TTGTTTTCAG CTTGTTTCCA ATTTTTTAAA GAACAGTAAG ACAATCTAAT 12 6 36 0 

AAAGTTATCT TTAATTGGCG TCCCCACGGG GATTCGAACC CCGGTTACCG CCGTGAAAGG 126420 

GcGATGTCCT AGGCCTCTAG ACGATGGGGA CATCAATTAA AGATACTCTA TTTTCTTTTT 12 6480 

ATCTTTTAAC GTTTATTTTT CAACGTTGCT TTCGCTTCGT TTTTAATCTT GTCTACAACT 126 54 0 

ATCAGCCAAT CTGTGTGAGC ACTCG CTGTC TCTCGTCTTC AGTAAGGAGG TGATCCAACC 12 660 0 

GCAGGTTCCC CTACGGTTAC CTTGTTACGA CTTCACCCCA GTCATGAATC ATACCGTGGT 12666 0 

AAACGCCCTC CAAAAGGTTA AGCTATCTAC TTCTGGTACA ACCCACTCCC ATGGTGTGAC 12672 0 

GGGCGGTGTG TACAAGGCCC GGGAACGTAT TCACCGCGAC ATTCTGATTC GCGATTACTA 12 678 0 

GCGATTCCGA CTTCATGGAG TCGAGTTGCA GACTCCAATC CGGACTTAGA CGTACTTTAT 12684 0 

GAGATTCGCT CCACCTCGCA GCTTCGCTTC CCTCTGTATA CGCCATTGTA GCACGTGTGT 12 6 90 0 

AGCCCTACTC GTAAGGGCCA TGATGACTTG ACGTCATCCC CACCTTCCTC CAGTTTATCA 12 6 96 0 

CTGGCAGTCT CCTTTGAGTT CCCGACCAAG TCGCTGGCAA CAAAGGATAA GGGTTGCGCT 12 702 0 

CGTTGCGGGA CTTAACCCAA CATTTCACAA CACGAGCTGA CGACAGCCAT GCAGCACCTG 12 7 08 0 
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TCTCTAAGTT CCCGAAGGCA CAAGCTCATC TCTGAGCTCT TCTTAGGATG TCAAGAGTAG 12 714 0 

GTAAGGTTCT TCGCGTTGCA TCGAATTAAA CCACATGCTC CACCGCTTGT GCGGGCCCCC 12 7200 

GTCAATTCAT TTGAGTTTTA ACCTTGCGGC CGTACTCCCC AGGCGGTCGA TTTATCACGT 12726 0 

TAG CTACGGG CACCAAGCTC TAAGCCCAAT CCCCAAATCG ACAGCGTTTA CAGCGTGGAC 12 73 20 

TACCAGGGTA TCTAATCCTG TTTGCTCCCC ACGCTTTCGC ACATGAGCGT CAGTACATTC 12 73 8 0 

CCAAGGGGCT GCCTTCGCCT TCGGTATTCC TCCACATCTC TACGCATTTC ACCGCTACAC 12 744 0 

GTGGAATTCT ACCCCTCCCT AAAGTACTCT AGTTACCCAG TCTGAAATGC AATTCCTAGG 12 7 5 00 

TTAAGC CCAG GGCTTTCACA CCTCACTTAA ATAACCGCCT GCGTGCCCTT TACGCCCAGT 12 756 0 

TATTCCGATT AACGCTCGCA CCCTCCGTAT TACCGCGGCT GCTGGCACGG AGTTAGCCGG 12 762 0 

TGCTTCTTCT GTATTTAACG TCAATTTGAT GTACTATTAA CACATCAACC TTCCTCAATA 12 76 8 0 

CCGAAAGAAC TTTACAACCC GAAGGCCTTC TTCATTCACG CGGCATGGCT GCGTCAGGGT 12 77 4 0 

TCCCCCCATT GCGCAATATT CCCCACTGCT GCCTCCCGTA GGAGTCTGGA CCGTGTCTCA 127800 

GTTCCAGTGT GGCTGGTCAT CCTCTCAGAC CAGCTAGAGA TCGCAGGCTT GGTAGG CATT 12 786 0 

TACCCCACCA ACTAC CTAAT CCCACTTGGG CTCATC CTAT GGCATGCGGC CTCTCAGTCC 12 7S20 

CGCACTTTCA TCTTCCGATA ATACGCGGTA TTAGCGACAG TTTCCCGTCG TTATCCCCCT 12 79 8 0 

CCATAAGCCA GATTC CCAAG CATTACTCAC CCGTCCGCCA CTCGTCAGCA AGAAAGCAAG 12 8 04 0 

CTTTCTCCTG CTACCGTTCG ACTTGCATGT GTTAAGCCTG CCGCCAGCGT TCAATCTGAG 12 8100 

CCATGATCAA ACTCTTCAAT TCAAGTTCAA TCGCTCAATA AACTG CTT AG CTATAAATAA 12 816 0 

AATACTACTT AAAAAAGTAA TAATGAATTT TCAGTTAAGC AC CTATTAAG ACTTCAAAAT 12822 0 

TAAAAATAGT TTTAAAACAA GTCAATCAAC AAGTGCCCAC ACAGATTGTC TGATATATTG 12 82 8 0 

TTAAAGAGCA AAATAAAGTG GTCGGCGAGA TAGGATTTGA ACCTACGACC CACTGGTCCC 12 8 34 0 

AAACCAGTTG CGCTACCAAG CTGCGCTACT CGCCGTCAGA TCTTATAAAG ATAAGATGGG 1284 0 0 

GTGGCTAATG GGATTCGAAC CCACGACAAC TGGAATCACA ATCCAGGGCT CTACCAACTG 12 8460 

AG CTATAGC C ACCATTAAAA GTTGTTACTG CATTGACCTG GCGCGCTCGA CAAGATTCGA 128 52 0 

ACTTGCGACC TTTGGCTCCG GAGGCCAACG CTCTATCCAA CTGAGCTACG AACGCGTTAC 12 858 0 

TGCGTCGTTG CGGAGGCGTA TATTAATGAT TTAATAAAAC CTTGTCTAGC ACTTTTTGAA 12864 0 

AAAAAATGTT T GAG CG AT AG TTTTTTATTC AGTTTGTTTA AAATACACTA TTTTTTCGAC 128700 

TATTTTTACA ACTTGGCGAA TCATTATATA TGTgGAATCA TCCAACAATT ATGAATTTGT 12876 0 
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TTATTACGTT CAAAATCTAG CGGTAACGTT TTATGCGAAA TTTCTATTGC ACTTAAG CCT 128820 

AATTCTTCCA ACGCAACTAA ATTCATTTTG AATCCACGTT TATTATT CGA AAAAACAATT 12 8880 

GTGCCATTAT TCGACAAAAC TCGTTTTAAA TTGCGCATTA ATTTCACATG ATCTCGTTGC 12 8940 

ACATCCCAAC TCTCTTCCAT ACGTTTAGAA TTCGAAAACG TCGGTGGATC CGCAAATATC 12 9000 

AAATCAAATT GACGATCACA TTTTTCTAAC CATTGAAGGC AATCTGCTTG AATTAATTTA 12 906 0 

TGTgGTTTAC CTTCAATATC ATTCAAAATA AGATTTTGCT CCGCCCAATT AAGATAAGTA 12 912 0 

TTTGACATAT CAACTGTAGT TGTCGATTTA GCGCCACCAA GTGCTGCATG AACAGTGGCA 12 918 0 

GAACCTGTAT AAGCAAATAA GTTCAAGAAA TCTTTACCCT TTGCCAACTC ACCAATCATT 12 9240 

TTTCTTGTTA AACGGTGATC TAAGAATAAT CCAGTGTCTA AATAATCTGT CAAATTTACC 12 93 00 

CAAAGTTGCG TGCCATATTC ATTCACATAA AAGTATTCGC CTTTATTGGC TAACTTCTCA 12 93 60 

TATTGATTTG TAC CTTTTTG TTTCTGACGA ACTTTGAGGA TCAACTTATT CGTTTCCACA 129420 

CCTGTCACTT GCAAAGTTGC AGTAACGGCA TCTAACAAAC GCTGACGAGC TTTATTTTCA 12 9480 

TCAATATTTT TCGGCGCTGC ATACTCTTGC ACAACAATGT AATCTGCATA ACGATCAACA 129540 

G CTAAATTAT ATTCTGGCAA ATCAGCATCA TATAAACGAT AAGCGTCTAA TTCTTGTTGC 12 96 00 

TTTGCCCATT TGCTAATTTT CTTAATATTT TTTTGTAATC GATTAGCAAA ATCGGGGGCG 129660 

ACCTCCGAAG TGCGGTTAAA TTCTAGTTCC TTTTCATCTG TAATTGCGTC AGATTTTCGT 12 9 72 0 

TCTGAAACTT GATAATTCTT TTGCACACAA TCTAGTGGGC CATTTTTTGC TTTAAACTGT 12 9780 

CGGCTGGCAC GCATTCTCAA ACAATCAAGC AATGTGGATT CGCTACTAAA AACAGAAACA 12 98 4 0 

TTCCAACCGC AAAATTCTTT TTTAAGTCTT TGCCCAAATA CAGAATAGAG CGCGATTAAA 12 9900 

GCTGGTGTTG TTCCTAAACG TTCGCCATAA GGTGGATTAC AGATAACCGT ACCGACTTCA 12 996 0 

TTTAAACGAG GATTTTTTAA CGCTGCCACA TCAGCTTGCT GCCATTTAAT TAAATGTGAA 13 002 0 

ACACCTGCAT TTTGTGCATT TTTCTGCGCC TTTTTTAATA CGCGGTGATC GAGGTCAAAT 13 008 0 

CCATAAAAGT GCGGTTGAAT TTCACGTTGT TTTTCTTCTG CTAATTCAAT CGCTTCATTT 13 014 0 

TTTACTTTTT CCCAAGCCGA TTGATTATGC GCTTTCCAAA AATCAAATCC CCAATGTAAA 13 0200 

CGATAAAGTT GCGGCGCAAT TTTGGCTTCC ATTTGTGCCG CTTCAATTAA AAGCGTACCT 13 0260 

GAACCACACA TTGGATCTAC TAATGGAGAA CCCACTTGCC AACCTGAACG CATTAACAAT 13 0320 

TGCCGCCGCG AGAGTTTCAC GTAAAGGCGC TTGACCTGCA TCTTCACGAT AGCCTCGCAA 13 03 80 

ATGTAATGCT TCCCCACTTA AATCTAAAGA AATCACTAAA TTTTCACGAT TCAAATAAGC 13 044 0 
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ATGGATACGT ACGTCTGGCT GAATTTTATC TACATCAGGA CGAGTTTTAC CTTGACGTTC 13 050 0 

AAAGTAATCG ACAATTCCAT CTTTAACGCG CATCGCACCA AATTGAGTAT GACGAATTTC 13 056 0 

TTGGTTGGTT CCATTAAAAT CGACAAAGAA TGTCACTCGT TCATCAAATT GTGCTAACCA 13 0620 

ATTAAAGCCT GAAACAGCCG C AT AAAG AT C CAAATCACTG TAAATTTTGG TTTCAATCAA 13 0680 

AGGAAATAAA ATGCGTGAAG CAAGGCGTGA CCAAAGCAAA GTGCGGTATA AAGTTTCGTC 13 074 0 

ATCAGCTTGA TAATGAACGC CACCTTGAAC CACTTTTGCC TCTTGCGCAC CCAGTTCAGT 13 0800 

GAGTTCAACT TTTAATAATT CTTCAAAGCC ACGAGAAGTC GTTGCAAATA ATTGTTTCAT 13 0860 

TTAGGT AC CG ATTTTGAAAA TTTTGAGGGA TTATAGCATA GAAAAAGATG ATCTAATTGA 13 092 0 

GATCATCATA GCGAGTTAAA CTGTAAAAAG AAAAGCCCTG ATTTTCTTAA ATCAGGG CTT 13 098 0 

TAGAATTTGT TGGTTGCGGG GGCCGGATTT GAACCGACGG CCTTCGGGTT ATGAGCCCGA 13104 0 

CGAGCTACCA AGCTGCTCCA CCCCGCgtCT GATGGGGCGT ACTATACTCG CTTAAAATTG 131100 

AaTTGCAAGA ATTAAACAAA ACTTGATGTA tTGTTTGTTA ATAAAATGTT CAATATG TGG 13116 0 

TTTTATTAAT CAACTTCGCT AAATAACACT GGTTGTTTTC AGTAAATTTT GATAGCGTAT 13122 0 

GCAGCCCTAT TGTrGCTTAC TTTAGAATTA AAAAATGTCG ATGCTTAAAC CTTTCTGGTT 1312 8 0 

CAAAACTTTT TCTATCTCAA TTATT AC CGC ACTTTTGGTA GCTTGTACCT CTAACACAAn 1313 40 

AAACACTCAG ATTC CAACCn CCTCAAATGG GAGTGATCCT CAACAATTCG GTGTG CAAAA 1314 00 

TATACCAATC GAACTTATCA GCAAACCGCT CTTGTGCCTG TTTC CTATAT AGAAAACCAA 1314 6 0 

AGTG CGGTAA TAAATCAAGG CGATTTTTTA ACACAGCTTT C CAAT AT AAA AAATTATTCA 13152 0 

AGTAAACTTT CCACAAATTT TTACGATAAT TATGAAAAGA ATTACAAATT GGGTTCTTTC 13158 0 

TGGGGCAAAT ATTAATGAAC TCACTCAATT TAATATCCAA CCACAAATTA TGCGTGGCTT 13164 0 

TGATGGATTT CAAAATGTGC TGATGACAGG CTATTATTCG CCTATACTTT ATGCTCGTCA 13170 0 

TTCCCCACAA GGTCAATTTA AAAATCCAAT TTATCGTATG CCTGTAAAAA AACGTCTAAG 13176 0 

TCGAGCACAA ATTTATGCGG GAGCATTAGC GGAAAAAGAT TAG AGTTAG C ATATAGCGAT 1318 2 0 

TCAATGTTAG AAAACTTTTT ACTTGGTGTA CAAGGCAGTG GCTATGTAGA TTTTGGCGAT 1318 8 0 

GGCAATCTTA ACTATTTTGC TTACG CAGGA CAAAATGGTT ACCCTTACAC GG CTATCGGG 13194 0 

CGTTTATTAG TAGAAGATGG CGAAATT CC A AAAGAAAAAA TGTCTATTCA AGCAATTCGA 13 200 0 

GAATGGGGTA ATCGTAATCC CTCTCGTATA CAAAGCTTGT TAGAACGCAA TGAAGCTTAT 132 06 0 

GTATTCTTTA AAAATGATCC AAGTGGCAAA GTGAAAGGCT CTGCGGGCGT TCCTCTTGTA 13212 0 
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ATTCATTCTT 


AGACAAACCA 
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AAAAGTGATT 


ACCATTGCCG 
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ATCTTGCGGA 


TGTAAAACCT 


TTATTAAGTA 


AAGCTCACCA 


TG AG CATTT C 


CATGAAGATG 


133680 


GCGATCATGA 


T C ATG AC CAT 


AAGCACGAAC 


ACAAACATGA 


TCATAAACAC 


GACCATGACC 


133740 


ATGATCATGA 


TCATAAACAC 


GAGCATAAAC 


ACGACCACGA 


ACATCATGAT 


CACGATCATC 


133800 
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ACGAGGGTTT 


AACAACAAAC 


TGGCACGTTT 


GGTATTCTCC 


' AGCTATCAGC 


1 AAAATTGTTG 


133860 


CACAAAAAGT 


AGGGGATAAA 


. TTAACTGCAC 


AATTCCCAGA 


. TAAAAAAGCG 


TTAATTGCAC 


133920 


AAAATCTTTC 


AGATTTTAAC 


CGCACTTTGG 


CAGAACAAAG 


TGAAAAAATT 


ACGGCACAAC 


133980 


TTGCAAATGT 


TAAAGATAAA 


GGTTTCTACG 


TTTTCCACGA 


TGCTTATGGT 


TATTTCAACG 


134040 


ATGCTTATGG 


TTTAAAACAA 


ACGGGTTACT 


TTACCATCAA 


TCCATTAGTG 


GCACCGGGTG 


134100 


CAAAAACTTT 


AGCGCACATT 


AAAGAAGAAA 


TTGATGAACA 


TAAAGTAAAT 


TGCTTATTCG 


134160 


CAGAGCCTCA 


ATTTACG C C A 


AAAGTGATTG 


AGTCTTTAGC 


GAAAAATACT 


AAAGTCAATG 


134220 


TAGGACAACT 


CGACCCAATT 


GGCGATAAAG 


TTACTTTAGG 


TAAAAATTCT 


TATGCAACAT 


134280 


TCTTG CAATC 


TACTGCAGAT 


AGCTACATGG 


AATGTTTAGC 


TAAATAAACC 


TTTCTAAAAA 


134340 


CAATGACCGC 


ACATAAAAGT 


GCGGTCATTT 


TTTTTTC T AA 


ATTTGATATT 


TAGTTTTATC 


134400 


TCCAATATCA 


AAATGCAAAG 


GCATTCGCCA 


TTTTTGTATC 


GCTAAAACTA 


AAAGTAATAC 


134460 


GCCAGAAATA 


ATAGAAAAAA 


TGCGTAGTGC 


AATAAGATTA 


TTTAACACCC 


ACGAAAACCA 


134520 


CAAGCAAGCC 


ACAAATATCG 


GCTGTAAATT 


TAGCAAAGGC 


ACGCGGAAAC 


AAAAATATTC 


134580 


TTTAAAACAC 


AAACCACCTA 


AAGTAAGTAA 


CGCACCGCCA 


AGAGCCAACG 


CAGGATAATG 


134640 


ATTAATATAA 


AAAAGTAGTC 


CAATC CAAGT 


TG C AAATTG A 


AAAAG CAATC 


GAAAAGATTT 


134700 


TAAGTAAATG 


TGTAACGAAG 


AGGCACAACA 


GGTTGCCGCA 


ATTAATATAA 


ATGTGAGTGA 


134760 


TAAATCTGGA 


TACCAACTCA 


CTGTAAGAAC 


AGCCAACGCC 


GCTATCACAA 


AACCAGTTCG 


134820 


GTAAATAATA 


ACAGTCAGAT 


AATCTAAAAA 


ATCCATCGGT 


GATTTAATAT 


GTGGATCAGC 


134880 


CATTGTATAA 


ATGTTCCTTT 


TTATGAATTA 


CAAATTAAAA 


ATAAATATAT 


CCTAAAAGTC 


134940 


AGAAATAAAA 


AGTACAGTCA 


ATTTAAAAAC 


AGGTTCTAAA 


TTAACCGCAC 


TTTTATTATT 


135000 


TCAGTTTTAG 


CTTATCAAGA 


ATTTTTTGAT 


GAATGCCACC 


AAAACTGCCG 


TTAGACATCA 


135060 


CTAAAATATG 


ATCCGTAGGT 


TGAGCCTTTG 


CAACAATCAT 


ATCCACTAAT 


CGATCTAAAT 


135120 


TTGCATTCCA 


ATAAGCAGGT 


TG T AC AC ATT 


GGTTTGCAAT 


ATCTGCCACT 


TCCCAAGAAA 


135180 


GTTGTTCAGG 


TTGAAGCATA 


AACACGGCAT 


CAGCTCGACC 


AAGTGCAGGC 


GCGATGTCAT 


135240 


CTTTATGAAC 


ACCCATTTTC 


ATTGTGTTAG 


AACGAGGTTC 


AAGCACTGCA 


AGAATACGAA 


13 5 3 00 


CACCACCACC 


AACTTTATCT 


CGTAAAGCGG 


TTAGCGTAGC 


AAGAATAGCT 


TCTGGATGAT 


135360 


GCGCAAAATC 


ATCATAAACC 


GTAATGCTAT 


TAACTTCGCC 


TTTCACTTCT 


AAACGACGTT 


135420 


TAGCATTCAC 


AAAAGATCCC 


AACGCTTTGC 


ACGCATCTTC 


AATAGCAACG 


CCAGTATGAT 


135480 
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GAGCGGCAGC TATTGCCATT AACGCATTAT GCATATTATG TTGTCCCACT ACATTCCATT 13 554 0 

TTACTTCTGC CACTTTTTCT CCATGATGGA AAACCGCAAA ATGTGAAGCA TCATTTGTAA 13 5600 

TACGTTCAGC AAACCACTCG TTATCTTTGC CTAAAAATTG CTGTTGAGAC CAACATCCCA 13 5660 

ATGCCAAAGT TTCTTTTGCA CTTTGTTCAC TCGCCGAACT TAATACCAAG CCACTGGCTG 13 5720 

GGATAGTTCG AATCATATGA TGAAATTGGC GTTGGATCGC TTTGAGATCA TCAAAAATAT 13 5780 

CAGCGTGATC AAAACTAATA TTGTTTACGA TAAGTGTACG TGGATTGTAA TGGACAAATT 13 584 0 

TCGAACGTTT ATCAAAGAAA GCTGTATCAT ATTCATCAGC TTCAATAATA AAATAAGGGC 13 5900 

TATCTCCTAA ACGAGCTGAA ATGCCAAAAT TAC CCGCAAT TCCGCCGATT AAAAAACCTG 13 5 96 0 

ATTTTAAGCC ATTTTGTTCT AAGACCCACG TTAGC AT AC C TGTGGTCGTT GTTTTTCCAT 136 02 0 

GCGTACCTGA AACCGCCAAT ACCCAACGAT CGCGCAATAA ATGATCATGT AACCATTGAG 13 608 0 

GGCCAGATGT GTATTTCAAA TTATTTTCCA ACAC AT ATT C TACGCAAGGA TTACCACGTT 13614 0 

TCATCGCATT ACCAACAATT ACCATATCAG GCGCAGGTTG T AATTGTG C A ACATCATAAT 136 2 00 

TTGGAATAAT CTCGATACCT TGCTCTTCTA AAAAAGTGCT CATTGGAGGG TAAACATTAG 13 6260 

TATCGGAACC CGTTACGTGA TAG CC C ATTT GTTTAGCAAT CATCGCGACA CCGCCCATAA 136 32 0 

ATGTGCCACA AATAC CTAAG ATATGAATAT GTTTCATTGT ATTTCTCTGT TGATTAAGGA 13 63 80 

AAACGAAAAA TATGATAAAA CAACTTCTAT GAACTTTCAA AAGAAAAGAC CATCATACCA 13 64 4 0 

AACTATATGA TCTATAATTA ACGCCACTGA ACTATTATTT AACCACTTCA ATAGGAATAA 136 500 

ACAATGAAAT TTAAGCGATA AAGCAATTTC AGAAGCACAA TGGCTCACTA TTTACAAAGA 13656 0 

AAATAATCAG CCGTTCTATG TAG CAAATCT TCCTGCATTA CAATTTGGAA AAAGTATTCA 13662 0 

AGTCGATATT ATTGGGACTT CAATTCATTT TGCTGACGGT AC CCAAATTG GTTTAGTTTC 13 6680 

AGAACAAAAA TAATCCTATA AAAAATGCCT GTGTAAAACA GGCATTTTTT TATCCGCACT 136740 

TTTAAGCAAT TCGCTCAAAA CCAGCTTTCA AATCTGCAAT TAGTTCATCC ACATTTTCAA 13 6800 

AACCAATATG CACACGAATT AATGAACCTG TAAGATTACG TTTAATATTT GGACGAATAT 136 86 0 

GTGCAATTTC TTCTGGTTGA TTACAAAGAA TTAGAGACTC AAATCCTCCC CATGAATAAG 13 6 92 0 

CCATAGCGAA CAGTTGAAAA TGATCCATAA ATTTTGATAC TTGCTCTGAA GTTAATCGCT 136 98 0 

GGGTTAATTC AAAAGAAAAT AAACCACTGC TTCCACTAAA ATCACGTAAA AAGAATTCAT 13 704 0 

GACCAGGGCA ACTTGGTAAA G CTGG ATGGT ACACAGTTTT TACTTCTGGT TGCTCACTCA 13 710 0 

ACCATTTTGC CACTTTAATA CTGCTTTCCT GATGTTGTTT TAATCGTACC CCCAATGTAC 13 716 0 
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GGATACCACG AGCAGTCGTA TAAGCGGAAT CTGCATCAAC CATTTGCCCC ATTAAATAGG 13 722 0 

AATGTTCTCT AAGTTGATCC CAAGTTCGCG CATTCGCAAC GGCAGTACCA ATCATAATAT 13 7280 

CTGAATGACC CACTAAATAT TTCGTACCCG CCTGAATAGA AATATCAATA TCATGCTCTA 13 7 34 0 

ACGCTTTAAA TAGAACACCT GCCGACCAAG TATTATCAAT CATAATCACA ATATTAGGGT 13 7400 

TCACCTTTCT TGCAGCCTTC ACAATAGTCG GAATATCGGG AATTTCCATT GTAATAGAAC -13 7460 

TCGGTGCTTC TAAGAAAAGT ACTTTTGTAT TCGGTTGAAT AAG CGTTGC A ATATCTTCAC 137 52 0 

CAATAAGAGG ATCGTAATAA GTTATATCGA TCTGCATTTT TTTCAGCACA ATATTACAGA 13 7 58 0 

AATATTGGGT TGGCTCGTAA GCTGCGCCAC TCATTAGCAC ATGATCGCCA GTTTTAACAA 13 7640 

AAGAAAGAAT CGAATTAGTT ACTGCTGCTG TACCACAAGG ATAAAGATAA CAACCTGCCC 13 7700 

CCCCCTCCAT TTCGCACATT AAATCTTGTA AAGCAAAATG TGTTAGCGTC CCTCTTCTGC 13 7 76 0 

CATAAAATAA TTCGCCTTTT GCTCTACGTT GAGTAGCGTG TTTTTTATCT TCAATACTGT 13 782 0 

CAAATAGTAA TG AAG AAG C T CGCTGTAAAA C AG GGTTT AC CGATCCTTGT GAAAAACGTT 13 7 8 80 

TTTGACGACC AGAATGAATA AACATTGTTG AAAGATCATA TTTTGTTTGC ATATATTTCC 13 7 94 0 

TTATTTAAAT TAACTTATAG GGCAAAAATA ACTATGGTTT CAATTCCTAT CCGCACCTGA 13 8 0 00 

TTTTACTTCA TCTGCGGTAT TAGGCAATGG AAAAAATAAA GAGAAAACAA CAAATAATTC 13 806 0 

TGACGTCTAC ACAGAATCCC TAAAATAGAT GCTAACGAGC TTTAATTTTA TCAATTATTT 13 812 0 

ATAAAAAGCG ACCTTAGGTG GCTTTTTTTA TCATCAGAGA TAATTTAGAA TAAACCATCT 13 818 0 

TTAGCTAAAG TGCGGTCAAT TTTTCTATGA AATTCAATAT CCCAATTTTC CTAACGATCT 13 824 0 

TTCGAGTCAT TTTGATTCCT TTTTTCGTGA TTGCATTTTA TTTACCCATT GAATCATCTC 13 8 300 

CTTTCATCAC AACG CTCATT TTTTTTATTG CAGGTGTCAC TGACTGGCTA GATGGCTATC 13 8 36 0 

TAGCTCGTAA ATGGAAACAG ACCACGCGCT TTGGTGCATT TCTTGATCCT GTTG CTG AT A 13 84 2 0 

AAGTAATGGT AGTGG CTGC A CTAGTTTTAA TTGTCGAACA TCAACATACA TTTTGGATAA 13 8480 

CCATTCCTGC CATTATTATG ATTTCTCGTG AAATTATCAT CTCCGCATTA CGTGAATGGA 13 854 0 

TGGCAGAGCT TGGGGAACGT AGCAAAATCG CAGTCTCTTG GTGGGGAAAA TGGAAAACCA 13 8 6 00 

CAGCGCAAAT GTTAGCACTA GGCGGATTAT TATGGCGTTA TAACAACTAT ATGGAAATAG 13 8660 

CAGCAATTAT CTTACTTTAT ATCGCCGCTA TTTTAACTAT ATGGTCAATG ATCCAATATT 13 8720 

T AC AAG TAG C AAAAGGAAGC TTGCTAGATA ATATTCAATT ATAAATTTAA ATAAAACACA 13 878 0 

GTTATTTTTT ATGATAACCA TCTATATTTG AGTTATTTTT AATCAAATAA AATATTTTTT 13 884 0 
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ATTTTTTTCG TTTGACAGCA ATATATAAAT CCGTAAAATA CGCCCCGTTG TCACGCAGTA 13 8 900 

GTGCTGATTA GTGATAAATG CGGGAATAGC TCAGTTGGTA GAGCACGACC TTGCCAAGGT 13 8960 

CGGGGTCGCG AGTTCGAGCC TCGTTTCCCG CTCCAATTTG CCCGAGTGGT GGAATCGGTA 139020 

GACACAAGGG ATTTAAAATC CCTCGCCTTT CGAGGCGTGC CAGTTCAAGT CTGGCTTCGG 13 9080 

GCACCATTTA AACACCAATT CAAAGATTTT AAAATTTATT TTAAAATCAT CTAATTGGGT 13 914 0 

CGTTAGCTCA GTCGGTAGAG CAGCGGACTT TTAATCCGTT GGTCGAAGGT TCGAATCCTT 13 9200 

CACGACCCAC CACTTAAAAA TAAAGTAAGC ACCGAAAGGT GTTTTTTTAT TATCTAAATC 13 9260 

TCTGAAACTC ATATAAAAAA TCCCCGACTC AAAGAATCAG GGACTTTTAC AATCAAACGT 139320 

CTTTAATTAA TTAACGCTCA TTCCAACGTT TAATAGACTC ACGAATCACT TCTTTCGCTT 13 9380 

CTTCTACATC GCCCCAATGA GTTACTTTCG TTGAGCCAGG TTTCTTAAGC GCTTTATAGT 13 9440 

TATTGAAATG GAACTCAATT TGTTTAATTA AGTTTGCCGG TAAATCTGCT AATGAATTGT 13 9500 

AAGCATTTCC TGTATCACGA TCATCTGCAG GAACACAAAC GATTTTATCA TCCACTTCGC 139560 

CATCATCAAC AAATTTCATG ACACCAATTA CTTTAGCCTC AAGGAATACA CCTGTTGCTA 13 9620 

ATGGTTGGCG AGTGATTAAT AACACATCTA ACTCATCGCC ATCTTCATCT AAAGTTTGTG 13 9680 

GAATGAAACC ATAGTTAGTT GGTTTTGCAA AAATTGCTGG CTCAACACGA TCTAACTGGA 13 9740 

ATGCCGCAAC TTTACGGTTC CATTCGATTT TATGGCAGCT ACCTTCTGGA ATTTCATTTA 13 9800 

CTACATTGAT AATGCCAGCA TCTACATCGC CTGGCGTTAA AATTTGATTA AAATCAGCCA 13 986 0 

TTTTGTTTCC TTCATTTTAT CAAGTTAAAA AACGCAACAA GTATAGCAGn TTTTCTGCGA 13 9920 

ACTCGCAGTC TCATTTGTAG AGAAAATCAG CAAATAGATC CATCTCATAA CTATTCATAA 13 998 0 

TTTTGACCGC ACTTTATCCT AAATAAAACT CACGCAAACG TTTACTTATC GTTTTTATGA 140040 

TGTAAAATAG CGCACCTTAT TACTTCCAAT TTTTTACTTA ACTTCAAGGA ACACTATGCC 14 010 0 

AACTTTAGAA AAAACATTTG AGCTCAAACA ACGTGGTTCA ACTGTTCGTC AAGAAATTAT 14 016 0 

CGCGGGTTTA ACCACTTTCT TAGCAATGGT TTATTCTGTC ATCGTGGTAC CCAATATGCT 14 0220 

TGGTGCTGCA GGTTTCCCTG CGGAATCTGT CTTTATTGCC AC CTGTTTAG TTGCAGGATT 14 028 0 

AGGCTCTATC TTAATCGGCT TATGGGCAAA TGCACCAATG GCAATTGGTT GTGCAATTTC 14 034 0 

TCTTACTGCT TTTACAGCAT TTAGCTTAGT GATTGGTCAA AAGGTCGCCA TTCCCGTnGC 14 0400 

GCTAGGCGCA GTATTCTTAA TGGGGGTTGT GTTTAC CTTA ATTTCTACCA CAGGTATCCG 14 0460 

TGCATGGATC TTGCGTAATT TACCATCCAA TATCGCTCAT GGAGCAGGCA TTGGTATCGG 14 052 0 
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ACTCTTTTTA CTTTTAAXXG CCGCAAACGG TGTTGGTTTA GTCGTCAGCA ACCAAGCTGG 14 058 0 

CTTACCAGTA AAATTAGGCG ATTTCACCTC ATTCCCAGTC ATGATGTCCT TAATTGGCTT 14 064 0 

AGCGTTAATT ATTGGCCTTG AAAAAATGAA AATTAAAGGC GGCATTTTAT GGGTAATTAT 14 0700 

CGCTATTACT ATCGTAGGTT TAATTTTTGA TCCAAATGTA AAATTTGGCG GAG AG ATTTT 14 0760 

CAAAATG CCA ACCTTTGGTG AAAATTCTCT TTTCTTACAA TTAGACTTTA TGGGCGCACT 14 0820 

ACAACCCGCA ATCTTGCCTG TGGTATTTGC TTTGGTGATG ACTGCCGTAT TTGATGCAAC 14 08 8 0 

CGGTACAATT CGTGCAGTTG CAGGTCAAGC AGACTTGCTT GATAAAGATG GTCAAATTAT 14 0940 

CAATGGCGGT AAAGCCTTAA CTTCAGATTC AATCAGTAGC TTATTCTCAG GCTTATTCGG 141000 

TACAGCACCA GCTGCCGTTT ATATTGAGTC AGCAGCAGGG ACAGCAGCAG GCGGTAAAAC 14106 0 

AGGTATTACT GCTATCGTTG TTGGTG TGTT ATTCCTATTA ATGCTTTTCT TCCAACCATT 14112 0 

AGCTTTCTTA GTGCCAGGCT ACGCAACTGC TCCAGCATTA ATGTATGTTG GTTTATTAAT 14118 0 

GTTAAGCAAT GTAAGCAAAT TAGATTTCGA TGATTTCGTT GGCGCGATGA G CGG ATTAAT 1412 4 0 

CTGCGCAGTG TTTATCGTAC TTACCGCGAA CATCGTAACA GGCATTATGT TAGGTTTTGC 1413 00 

GGCATTAGTA ATTGGAAGAA TCGTGAGTGG CGATATTAAA CGcTTAAATG TAGGTACTGT 14136 0 

GATTATTGCA ATCGTACTTG TTGCATTCTA TGCAGGCGGT TGGGCGATTT AGTAACAGAA 141420 

ATAAGAAAAG GGAAAAGCCT TCCCCTCAAT TTTAATCACA AAAGTGCGGT GAAATTTTAC 1414 8 0 

CGCACTTTTT TATACTTGGA TTACTCTTTC TTCAATAAGA AAATACCGTG GAT C AG AT AA 14154 0 

ATTTACATAA GCTTGTTTTA AAATTGGCAT TAAAATCCCT CTGGTTTACA ATTTACCAGT 1416 0 0 

AAATCTTTGC CCCGCCCAGT TTGCAACCAC TTCCCCAATG GTTGCCCCAT ATAAACCGCA 141660 

CTTTTAATCT CGCAACGTTG TGCATCGCTA CCCTCTGCAG CAAGATAAAT TGCTTCTGGG 14172 0 

CGAATACCCA CTAAACATTC GCCATCGGGT AAATTAAACT GTGCAGCATC TTTTAAAGGC 14178 0 

ACAGAGTAAC CATTAATATC CGCCACGCCA TTTTCTAATT TTCCATCGAA AATACTGGAT 14184 0 

TCGCCCATAA AGTTAGCCAA AAACAGAGAA TTTGGTCGGA GATAAAGCTC TTTCGCCGGC 1419 00 

GCTTTTTGCA TAATTTTACC TTTATTCATC ACAATCACTT CATCAGATAC CGCAAATGCC 14196 0 

TCTGTTTGAT CGTGTGTCAC ATAAAGCGAA GTAATGCCTA AACGTTGTTG CAATTCACGG 14 2 02 0 

ATTTTTTCAC GCATAGAACG ACGTAAGTTT GCATCTAAAT TACTTAATGG TTCATCAAAC 14 20 8 0 

AATAGGACTT TTGGTTTCAA TACCAAAGCA CGAGCCAATG CGACACGTTG TTGTTGCCCA 14 214 0 

CCAGAAATTT GATCCACGAA ACGATCTTCA AAGCCCGCTA AATCCACTAA TTCTAAAGCC 1422 00 
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TCTTTTACAC GCTGAGCGCG TTCTTCTTTG CCAATGCCTT GCATTTTTAA GCCGTAGCCC 142260 
ACGTTATCGC CAATGCTCAT ATGCGGGAAA AGCGCGTAAG ATTGGAAAAC AATACAAATA 142320 
TCTCGATTCT GAATAGAGGA TTTTGTTACA TCTTCGCCAT CAATAAATAT TTGACCTGAT 142380 
GTTGGATTTT CTAATCCTGC CACTAAACGT AATACGGTGG TTTTACCACA GCCTGATGGC 142440 
CCTAACAATG TTACCATTGT GCCACGTTTG ATCG TTAAAT CTAAATTATC AATGACGACC 142500 
GCTTTACCAA ATGCTTTTGT GATATTTTTT AATACTAAGA AATCATTATT ACTCATAATT 142560 
TTTACCTACA ACTTAACGAG TTAATTCATT TTTTTCGCTT TAGAACGGGA AATACGCGTA 142620 
TCGCCTACAA TCCAGTCGAA AAACAAAATA ATTGCCATCA TCACAACAAT TAAAATAGAA 142680 
CCGTATGCAA TCGCAATGCC GTATTCTCCG TCTTCCACAC GATTTAAAAT ATATG CGGTA 142740 
GACGCGAGTn TCTGCTGTCA CTAAGAATAT AATCGCACTT ACCGTCGTCA TTGCACGAAC 142800 
GAAACTAGTC ACTAAAGCAG AAAGCAACGC AGGTTTTAAT AATGGCAATA CGATAAAGCA 142860 
CAATGTTTTC CACGAACTCC CTTTTAAAGA AAGCGATGCT TCATCAAGGG ATTTATCAAG 142920 
CTGTCCTAAC CCAGCAATCG CTGCTCGCAT ACCAATGGGT AAATCACGCA TTACCATTGA 142980 
AATAATCACG ATAATGCCCG TACCCGTAAT ATAAAGCGGT GCATTGTTGA AGGCTAAAAT 14 3040 
GTAAGACACC CCAGCCACAG TACCTGGCAC GGCAAAACAA AGCATAGTTA AGAACTCAAG 143100 
GGATTTTTTA CCTTGGAAAT CTTTACGCAC CACAATATAC GCAATCAATA ATCCAAAGAA 143160 
TGCAGTTAAT GGCGCAGCAA TACCCGCATA GATTAAGGTA TTGATTAGTG ACGGCCACGC 143220 
TCCATCGCTC AATCCTTGCC CGAATAACAT GGCATAGTTT TTCAAGGTTA AAGTGTAATT 143280 
TACGCCCCAG TTTACAGTGA AACTTCCGTA GAAAATACTA CCGTAAAGTG CCATATTAAA 143340 
GATTACCCAG AAGCCAAGCA TACCGATGAT GGTATATTTC AAGCCATTTG GCAGCTCTTG 143400 
CACATCGCCA CGATAAGATT TCCCCGAAAC AGTGACATAA GAGCGGTTAC CAATCCAAAT 143460 
G TATTGAATG ATAAAAATCG CCAACGAGAA AATTAACAAC ATCGAACCCA AAGTACTGGC 143520 
TGAAGCATAA TCAAGTTGTG AACCCGCAAT ATAGAAATAA ATTTGTGTGG CAAT C ACATC 14 3580 
GAAACTTCCA CCTAACACTA AAGGATTACT AAAGTCCGCC AATGATTGAA TGAATACAAT 143 64 0 
TAAGAAAGAA TTTG C TAATG CTGGACGCAG TAACGGAAAA ATAATCTGAT AGAACGTTTG 14 3700 
ATAACGATTA GCACGCAACG TATAAGACGC TTCTTCAATA GACGGGTGTA CGGATTTTAA 143 760 
TGCGCCATCT AAAATCATAA AAGAAATAGG AGCAAATGCG AGAATTTGAG CAATCGCAAT 14 3 820 
CCCATTAAAA CCATAAAGCC AATTATGACT AGAAAAACCA AAATTCGTAC TTAAAAATTC 143880 
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GGTTACATAA CCAGAACGTC CGAGCATTAA AGTAACCCCA AGTCCTACAA CAAATGGGGG 14 3 94 0 

CGTCACAATT GGCAAAATAG AGAAAATTTT TCCGATAAAT GCCGTTCTAC GAGCGATACG 14 4 000 

CGTGGTATAG AGCGCAAAGG CTAAACCAAA TACAGTAGAA ACGATGCCCA CAAAGCCCGA 144 060 

TAAGAAAAGC GAATTGGTAA TTACGCGGAC AATATAGCTC TGCGTTAAAA TTCGCATCAC 144120 

TTGTTGTGGC GCAAAGGTAT CGCCGTCATA AAACATTGAA ACGAAAATCG CCAATGTTGG 144180 

ATAAACGATA AAGAAGAAAA TGAGCAAGAT AATGCAAAGC AAAGAAGCGA TGATAAATTT 144 24 0 

ATCACCTTGC ATCACTTTTA GTTTTGCCAA TGAAAGTGTC GCCAATGCGG TATATCCCAC 144 3 00 

AAGTAACACA ATCACAGAAT AGCCCATACT AATTTTGTAC ACCGTCGCAC TGATAAACAT 144 36 0 

AAACAGTACA GAAAGTG CGG TCAAAAATAG CTCTGTTTTT GCTTGGGTTT GCTGTGGCAA 144420 

TCTCAATAAC GGCAAAG CAC CATACACAAT TACAGGCAGG AACCAAGCCC AAGTTAAATT 144480 

GAAAGAAGCC CAGCC CATTG CCTCTAAATA TTATCTGAAG T ACTTT C AAA CAAACCGTAA 144 54 0 

TCAAGTGCAT TGGAAGG C AG AAGTGCAAAC GCTAACAAGG ATAAAATAAT CCAAAAATAA 144600 

GCAGAATGCA TTATGGAAAG TTTTTTTGCG TTCATACAAC CTCGCAGTAA AGAAAACCTG 144 66 0 

ACTTTCGCCT AGGGTAAGTA GGCCCTAGGA TAATTACAGT AACTCCCCTA TTGGGGAGTA 144 72 0 

AAAAGCTGTC CCAAAGGGGA ACTACTATTA CGATAATTAG CCGTGGGTAT ACTTACTCAC 144 78 0 

GGCTATGATT GTCACGAAAA CTATTTCGCC AATTTAACTT CTTGAACCCA TTTTTCAATT 144840 

AAGGCTTTGC GTTGTTCTGT TGCACCGTAT TTTTCAAAGT CATAATTGAT TAATTTGAGT 144 90 0 

TTATTTGGAT CAAATGCAGT TGGCGATTGT TCTGCGGTCG TGTTAGTTAA G ATTTG T AAA 144 960 

GAATCCCCTT GTTTCCAAGC TAATTCTTGA CCTTCTTTTG ATAAAGCCCA ATCGACGAAT 14 5 02 0 

AATTTTG CAT TATCAATATT ACGCGCACCT TTTAAGAnAC TCACGCCACC TAATTCATAG 14 5080 

CCCGTTCCTT CGCACGGCAC AACTAATTCT AATGGCGCAC CATTG CGTTT TTCTAAAGCA 14 514 0 

TAATCGTGTA AGAAACCCAC CCCAATTGTC nTTCGCCACG CGCAGAGTTA CGTGATGGCG 14 52 0 0 

TGATACCCGA TTTGGTATAT TGAGAAACAT TAGGATGTAA CTCTTTTAGG AAATCGAATG 14 5260 

CCTCTTTTTC GCCCCATAAT TGAACGAAAG TTGCCAATGC AGTGTAAGCA GTACCCGCAC 14 532 0 

TTTGAGGGTC TGCAATTTGA ACTTCACCTT TTAAGCGTGG ATCGGTTAAG TCTTTCCAGC 14 53 8 0 

ATTTTGGCAC TTCTTTAATA CCTAATTTTG CCAAACGTTC AGTATTCACA CCGAAACCTA 14 5440 

AGATCCCCAT ATAAATTGAG GAAACATAAT GGCCTTTCGT TTTCGCTGGT TCACGGAAAC 14 5 50 0 

GTTCTACAAT TTCATCAATA TGTTTGGATT TATAAGGCTC AATTAACCCT AATTCTGCCG 14 5 56 0 
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CTTGAGCTTG AGGGTCAAAA GTACCGCCGA ACCAAACATC CGCTTGAGGG TTATTTTTTT 145 S20 
CAGCTTCAAC TTTAGCAAAA GTACTGCCTG AACCATTACG AATAAAGGAT GTTTTCACAT 145680 
CATATTTTTC GCCAAATGCT TTCGTGGTGG TTTCGCACAA AATATTGGTT GCACTACAAT 145740 
ATACAACTAA ACGACCTTTA GCGTTAGCAG AACCAGAAAC AGCCAAGCCA GCAGCTAAAA 145800 
GTGCGGTAGA AACAGAAAGA GAAATTTTG T TGAATTTCAT AATACATACT CCTTATTGGA 145860 
TAAAATGTTT GAATCCTACT TGAAAAATAG AAATAATACC GTGATGAAGT TCACAGTTTT 145920 
TAAAATTCAA GTGAAAAAAT AnACGTGAAA TTCACCGCAC TTTTCTTTAT AATTGCAAGA 145980 
ATTTTCCTTA CATCGAAACA CGATTATGTC AAATCCCTCT TGTATCATCA TCGCCATCAC 146040 
AGGTGCATCT GCATCAGGCA AAAGTTCTAT CGCTTCCACT GTTCACAAAG AGCTCTGTAA 146100 
TGAACTAGGC TGCCAAGAAA TTGGTATCAT TACTGAAGAC AGTTACTATA AAGATCAAAG 146160 
TCATTTAGAA ATGACTGAGC GGGTAAAAAC GAATTACGAT CATCCAAACT CTATGGATCG 146220 
CGATTTACTT ATCCAACATT TAAAAAATCT AAAAAATGGC AGTGCAGTAG ATGTGCCTGT 146280 
TTATAGCTAT GTAGAACATA CTCGCACCAA CGAAACAACG CATTTCACAC CAAAACGAAT 146340 
CGTAATTTTA GAAGGGATTT TATTACTTAC CGATGAACGA GTACGCCAAT TAGCCGATAT 146400 
TTCTGTATTC GTAGACACAC CACTTGATAT TTGTTTCATC CGCCGITTAC AACGTGATAT 146460 
GGAAGAACGT GGTCGCTCTC TACAATCAGT GATTGATCAA TATCGTGCAA CCGTGCGTCC 146520 
AATGTTCTTA CAATTTATTG AGCCGTCTAA ACAATATGCG GATATTGTCA TTCCTCGTGG 146580 
TGGTAAAAAT CGCATTGCGA TCAATATGTT AAAAGCTCAA ATCCTTCATT TATTGAATCA 146640 
AAAATAGGAA ATCTTATGCG TCTTTGCGAT ACTGATATTG AACGCTATCT TGATGATGG C 146700 
ATTATTTCTT TAACACCACG TCCAAATAAT GACAAAATCA ACGGTGCGAC AATTGATGTT 146760 
CGCTTGGGGA ATTCTTTCCG TGTATTTCGT GAACACTCTG CACCTTTTAT TGATCTAAGC 146820 
GGCCCGAAAG AAGAAGTATC CGCCCAGCTT GAATCAGTAA TGAGCGATGA AATTATTATT 146880 
CCAGAGGGGG AAGCATTTTT CTTACATCCT GGTACTTTAG CCTTAGCAAC CACGCTTGAA 146940 
TCCGTGAAAT TGCCAGCCAA TATTATCGGT TGGTTAGATG GCCGTTCTTC TTTAGCCCGT 147000 
TTAGGCTTAA TGGTACACGT CACCGCACAC CGTATTGATC CAGGTTGGGA AGGAAAAATC 147060 
GTGCTGGAAT TTTATAATTC AGGGAAATTA CCATTAGCAT TACGCCCAAA TATGGTAATT 147120 
GGTGCGTTAA GTTTTGAAGT ATTAAGTGGC GAAssAAAAC GACCATACAG TAGCCGCAAA 14 718 0 
GATGCAAAAT ATAAAAnTCA ACAAAGTGCA GTAGCCAGTC GTATTGACGA AGACAAGGAA 14 7 24 0 
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TAATCGATGA AAAAGTTCGC ACTTATCGTA GGGATCGTCG CTTTGGCGAT CTTTTCTTTT 147300 
TTATATATTC AACTTTACCG CGTGCAAAGT GCTATTAATG mACAACTGGC ACAACAAAAC 147360 
ATTGCCGTTC AGTCAATCAA TTTATCATTA TTTTCTCCCG CACTTTCCCT TGAAAATATA 147420 
AAAACTACGC AATTTTC AG C TCAAAAAATT GAAGCTAAAT TTAGTTTTTT GCCTCTTTTG 147480 
TATGGAAACG CAGCACTTCA TTCACTGAAT ATTCAACAGC TCAAATTAAC TCAAAACACA 147540 
CAAAATCCTG CCAATGTATC GATAGAGATT TCGCCTTTTT CATTAAAACA GCTTTTATCA 147500 
AAAAAAGTAA TATTAAATGG CGAAAACCAT ATTCGCATTG AATTTAATAA AC CTATTTAT 14 7660 
GGAAAAACAA AAACTTTCCA TTTTTCTTTC CATAAAGCGA ATTTAGACTT CTCAACATCA 147720 
GAATCAACCC CACTTCAATT TGTAGATGCA AGC CTAAATA ATCAGCCTAT CGGTTATATT 147780 
GAAACTCACA CAGCCCACCA ACAAATAGTG ACTTATATAA AACCACAATG CGATAACGAT 147840 
TGTTTAGCCG TATTGAAATA TCAACAAATA GACAATCAAA GTGCGGTAAA TTTTTCGGGT 147900 
AAATATTTCC CTGTTCAACG CTTATTCGCA CTATTAAATT TGCCAGAAAT GTTAAGTGGA 14 7960 
CACGCCGATT TTGATCTTGA TTTCAGCTTT TCATCTTCCT CGCTTATTCA AGGAAAATTA 148020 
AATTTTTTAG CGCAAAATGG CGAAATTCTT GGTGTGAATT TATTGGATAT GGTAGCGCAA 148080 
TATTTCCCGA TTAATTACAA CAATGATTTG TTACAAAACA AAGAATTAAA CACTCGCTTT 148140 
GAACAATTTT ACTTACAATT ATTTCTC CAA CAAAATCAAC TCATTGCTGA AAAAATCGAA 148200 
TTAAAAACGC CCGCACTTTT GGGGCAAGGA AAAGGCATCA TTGATTTAAA TCGAATGGAA 148260 
TGCAATGTGG ATATTAATTT AAATTCTACG GATCAACGCT ATCAAAATTT GACACTTCCA 14 832 0 
ATCAATTTTT TTGGTAACTG CAATTCTCCA CAATACAAAA TTAATTTTAC GAAGAAATTC 148380 
CGCCATCAAT TAATTGATGC GATTAAAGAA AAGTTACGTT AATGTCGTTA TATTTAAAAG 148440 
CTGAAAAAAT CCAATCATGG CGCGTGCTTA TTATGGCTTG CGCAGGATTT ATTTTTAATA 14 85 00 
CTACAGAATT TGTACCCGTT GCAATGTTAA GCGATATTGC ACAAAGTTTT GATATGCAAA 148 560 
CCGCCGATAC TGGTTTAATG ATGACAGTTT ATGCTTGGAC GGTGCTAATT ATGTCTTTGC 14 862 0 
CTGCGATGCT TGCCACGGGC AATATGGAAC GTAAAAGTCT GCTTATTAAG C TTTTC ATTA 14 8680 
TATTTATTGT TGGG CAT ATT TTATCGGTCA TTGCATGGAA TTTCTGGATA TTACTTCTCG 14 8 74 0 
CTCGAATGTG TATTGCATTA GCACATTCGG TGTTTTGGTC AATTACTGCC TCACTTGTGA 14 8800 
TGCGCATCTC ACCTAAACAC AAGAAAACTC AAGCATTAGG AATGCTCGCT ATTGGCACCG 14886 0 
CACTTGCCAC AATCTTAGGC TTACCGATTG GACGAATTGT CGGACAGCTC GTCGGTTGGC 148 92 0 
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GGGTCACTTT TGGCATTATT GCGGTACTGG CATTATCCAT CATGTTTTTA ATTATTCGAT 148 980 

TATTGC CAAA TTTACCCAGT AAAAATGCTG GCTCTATTGC AAGTTTACCC TTGCTTGCTA 14 904 0 

AACGTCCATT ATTACTTTGG CTTTACGTAA CAACGGCAAT CGTCATTTCA GCACACTTTA 14 9100 

CTGCTTATAC TTATATCGAA C CATTTATGA TTGATGTAGG GCATTTAGAT CCAAATTTTG 14 916 0 

CCACGGCTGT ATTATTAGTT TTTGGTTTCT CAGGCATTGC AGCAAGCCTA TTATTTAACA 1492 2 0 

GATTATACCG ATTTGCTCCC ACAAAATTTA TTGTTGTTTC GATGAGTTTA TTGATGTTCT 14 9280 

CCCTCTTGCT ACTCCTCTTT TCGACCAAAA CAATAATAGC AATGTTC AG C CTTGTGTTTA 14 93 4 0 

TTTGGGGGAT TGGCATTTCT TGTATCGGAC TATCTCTACA AATGCGCGTA CTAAAACTTG 14 9400 

CGCCAGATGC AACCGATGTT GCCACAGCTA TTTACTCGGG CATTTTCAAT GCAGGAATTG 14 94 6 0 

GCGCAGGCGC ATTATTTGGA AACCTTGCGA CAACCTATTT AGGATTAAAT GAAATTGGTT 14 952 0 

ATACAGGCGC AG CATTAGGT TTAATCGGTT TCATCATTTT TATTACAACG CACTTAAAAT 14 95 80 

ATCGCCATAC ATTTCTTCTG CAAAACAAAT AAAAAAATAA CCGCACTTTA GTTCTCCAAA 14 964 0 

AAGAGTTTTA AAAGTGCGGT TAAATCTTG C TAAGTTTTAT TTATCTTGCG TTTAACGTTT 14 970 0 

CGCTTTCTTA ATAAATTTCA TCAAACGTTT ACGTTTACGC AATTGGTTCG GAGTGAGTTT 14 976 0 

ATTTTTGCGT CCTGCAAATG GGTTTGAGCC TTCTTGGAAA AGCAGACGAA TCGGTGAACC 14 3820 

AATAATTTTT AAGCTCTTAC GATAATAATT AGATAAATAA CGTTTATAAG AATCTGGTAG 14 98 8 0 

TTTATCCATT TGGTTACCGT GAACCACTAT AATCGGCGGA TTATAACCAC CTGGGTGAGC 14 994 0 

ATATTTTAAT TTAATGCGAC GACCGCCAAT CATCGGCGGT TGGTGCTCAT CCGTTGCCAT 15 00 00 

TTGCAAAATA CGAGTTAAAA GCGAAGTCGT CATTTTTTGA GTGGCACAAG CATAGGCTTC 15 0 060 

TTTAATCGAA TCAAAAAGAT TACCTACACC ACTTCCGTGC AAGGCTGAAA TAAAATGCAC 15012 0 

CCGAGCAAAA TCAATAAAAT CTAAACGACG ATCAAGTTCA GATTTCACAC GATCTTTCAC 15 018 0 

ATCTTGATCT AAACCATCCC ATTTATTCAC GACGATCACC AAAGAACGCC CTGCATTTAA 15 024 0 

AATAAAGCCG AG C AAAG AT A AATCCTGATC AGAAATGTTC TCTCTCGCAT CAATAGTCAG 1503 00 

CAATACAACA TTAGCATCTT GAATCGCTTG TAAGGTTTTA ATCACAGAGA ATTTTTCCAC 15 0 36 0 

TGCCAAATGC ACTTTACCAC GCTTACGCAC ACCAGCCGTA TCAATCAAGG TATATTGCTG 15 0420 

CCCATCACGC TCCATTGGAA TATAAATACT GTCACGTGTC GTACCTGGCA TATCAAAAAC 150480 

CACGACACGA TCTTCGCCTA AAATACGATT GGTTAAAGTA GATTTACCCA CATTTGGACG 15 054 0 

ACCAACAATG GCAATCTTAA TATTTTTATC TTGTTCTTCT TCAAGTTCTT CGTCTAACGC 15 06 00 

-77.91- 

SUBSTITUTE SHEET (RULE 26) 



WO 96/33276 PCIYUS96/05320 

ATCATCAATT AATGCCGTAT CTTCTTCTGA ATTCAAAATC GAATTCTTGT TCCCATTCGT 15 0660 

CTTGTTCTTC CTCAGAAGTG CGGTCATTTT CATCGGCGTT TTCCATTTTT TC CGC AAAAG 150720 

GssGCAAGTA CTTGTTCCAT TAATTGAGTA ACACCACGGC CTTGTGAGGC TGCGATTTGC 1507 80 

TCAATTTCCC CTAAACCTAA CTGATAAAAT TCAGCACAAT GGGAATCCGC ATCAATACCA 15 084 0 

n CAGTTTT AT TTGCCACCAC CACAGTAATT TTGTTTTGAC GTTGGCGTAA ATAATTAGCA 150900 

ATACCAATAT CAGCTGCCGT TAAACCCGCT CTAGCATCCA CAAGGAAAAG AACAATATCC 15 096 0 

GCTTCATCAA TAGCAAGCAA AGATTGCTCC GCCATTTTTT CTTCTACACC CTCTTCCGTT 15102 0 

C CAT C AAT AC CGCCAGTATC AATAACAATA AAATCATAGC C AG CAATATG TGCGTGACCA 15108 0 

TATTTTCTAT CACGAGTTAA ACCGGGAAAA TCAGCGACTA ACGCATCTCT CGTACGAGTA 15114 0 

AGGCGATTAA ATAATGTGGA TTTTCCCACA TTTGGGCGAC CAACAAGGGC GACGACTGGA 1512 00 

GTTGCCATAA AAAAT CTCTT ATTTTGTCAA AAATGGCGAC TATTATATCG GAAGTTAAAG 151260 

GATTGGTAAA AAATCATTTC ATAAAGAAAA GAAAAAGCCA CAGGATACAA TCCTATGGCT 15132 0 

TTCTAAATAC TGTTTATTTA TCTACAACTT AACGTTATAA TAATCTCGCT TTCAAGGATT 1513 8 0 

CAGCTTGAAA GCGAGTGTTG CGCTTTTTCG CAACGTTTGA AGAGGATAAG AAAATGATTA 15144 0 

AGATCATTAT CCTTGTCGTA CTACTATTAT TAGTGAGCCA GCCCGCTTAC TAATAACACG 1515 0 0 

ACATACTAGG GAGTAACCGC TCCCTAGTTC TTCAAATATT AAAAGTTTAC TTTATTTTTG 15156 0 

TCAAGATGAA TAAATTTCAG ATACAAAAAA AGCACCCTTA GGTGCTATCT TTATTTATCT 15162 0 

TTATTTATCT TAATTGGCGG AATGGACGGG ACTCGAACCC GCGACCCCCT GCGTG ACAGG 1516 8 0 

CAGGTATTCT AACCAGCTGA ACTACCACTC CG AG AT AAG A TATTAATTTT AAATTGGCGG 15174 0 

AATGGACGGG ACTCGAACCC GCGACCCCCT G CGTG AC AG G CAGGTATTCT AACCAGCTGA 1518 0 0 

ACTACCACTC CGCTAAGTGG TGCGTATCTT AATGATGAAC ACAAATCTCG TCAATCTTTT 1518 6 0 

TTTATTAAAT TTTACTTACC TGCTTAAATT CCACACGATT AATG CACATT ATTATTGCCA 15192 0 

AAACGTTTAT CCCAAAGGCA ATTATTTCCA CTTTTTTTAT TCATCATCTT TAAGAATTCT 151980 

AAATGCGCTT GTAGCTCATC ATTAGTAGGT TGAAGTAATT TCAAATTGTG CGAAAAATCT 152 04 0 

ACCGCACTTT TTATTTCTTC CGCCGTTTTT TCTTGCATAA CACGGATAAC TCCAGACTCA 152100 

ACAGATTCTT CTTCATCGAA TAAATTTGTT TGCCCTCCCG TCATCATCAA ATAAACATCC 15216 0 

GCCAAGATCT CGGCATCCAG TAACGCACCG TGCAAAGTAC GTTTGCTATT ATCAATACCT 1522 2 0 

AAACGATCAC AAAGTGCATC CAAATTATTG CGTTTTCCAG GATACATCTG ACGCGCCATT 1522 8 0 
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TGTAGCGTAT CTGTCACCAA ACAAATATCG TCAGTTTTCA CATTGAGGTT AAGTTTACGA 152340 
AATTCATAAT CCATAAATCC AACGTCAAAG GGGGCATTGT GAATAAGCAA TTCTGCGCCA 152400 
TTAATATAAT CCAAAAAATC TTGGGCAACT TCTTTGAATT CAGGTTTATC CGCTAACATT 152460 
TCATCAGTAA TTCCATGAAC TTTAATCGCA TCAGGATCAn CGGGGCGATC TGGCTTAATA 152520 
TAAATATGAn AATTATTACC TGTGTAacGA CGATTTATTA ATTCAACTGC ACCAATtTCA 152580 
ATAATGCAGT GTCCTTCATA ATGCGCGCCA AGCTGATTCA TACCTGTAGT TTCAGTATCT 152640 
AATACAATTT GGCGATTGGG ATTAATCATT TGTTCAACCT ATCTCTTTCC ATTAAAATAC 152700 
TTGCCATTCT ACACAACAAC CTTTTTGTTA TGCCnAAACA GATTGAAATT TTTACTGATG 152760 
GATCTTGCTT AGGTAATCCA GGGGCGGGCG GAATTGGTGC CGTATTGCGT TATAAACAAC 152820 
ATGAAAAAAC ACTCTCCAAA GGCTATTTCC AAACCACCAA TAATCGAATG GAATTACGCG 15288 0 
CTGTCATTGA AGCATTAAAT ACATTAAAAG AACCTTGCTT GATCACGCTT TATAGTGATA 152 94 0 
GCCAATATAT GAAAAATGGC ATAaCcAAAT GGATCTTTAA CTGGAAAAAA AATAATTGGA 153000 
AAGCAAGTTC TGGAAAGCCT GTAAAAAACC AAGATTTATG GATAGCCTTA GATGAATCCA 153060 
TCCAACgTCA TAAAATTAAT TGGCAATGGG TAAAAGGCCA TGCTGGACAC AGAGAAAATG 153120 
AAATTTGCGA TGAATTAGCA AAAAAAGGGG CAGAAAATCC GACATTGGAA GATATGGGGT 153180 
ACATAGnAGA ATAATACAAC TGATATAACG TCATATTTTT CGATACCTAA AAATATTTAA 153240 
TACTTAAACC TAAAACAGAA TAAAAAATAA TCAAATTCAT TTaAAAAATG TGATCTCGAT 153300 
CAGATTTCAA GnAAATTAAA ATTTTGGAGT ATTGACATCA AAAATTTTTT TTGTAAAGAT 15336 0 
GCAGCTCGTC CGTTTTGGCG ATTGGACAAT TCTATTGGAG AAAAGTTCAA TCATAGATAG 15 3420 
TAAACAACCA TAAGGnATAC AAATTATGAA AAAAACACTT GCAGCATTAA TCGTTGGTGC 15 3480 
ATTCGCAGCT TCAGCAGCAA ACGCAGCTGT TGTTTATAAC AACGAAGGGA CTAACGTAGA 153540 
ATTAGGTGGT CGTTTAAGCA TTATCGCAGA ACAAAGTAAT AGCACTGTAG ATAATCAAAA 153600 
ACAGCAACAC GGTGCATTAC GCAATCAAGG TTCACGTTTC CACATTAAAG CAACTCATAA 15 3 660 
CTTCGGTGAT GGTTTCTATG CACAAGGTTA TTTAGAAACT CGTTTTGTTA CAAAAGCCTC 15 3720 
TGAAAACGGT T C AG ATAACT TCGGTGATAT TACAAGCAAA TATGCTTATG TTACTTTAGG 15 3 780 
AAATAAAGCA TTCGGTGAAG TAAAACTTGG TCGTGCGAAA ACTATTGCTG ATGG CATAAC 153840 
AAGCGCAGAA GATAAAGAAT ATGGCGTTCT CAACAATAGT GACTATATTC CTACTAGTGG 153900 
TAATACCGTT GGCTATACTT TTAAAGGTAT TGATGGTTTA GTATTAGGCG CTAATTATTT 153 960 
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ATTAGCACAA 


AAGCGTGAGG 


GTGCAAAAAA 


TACAAATAAG 


CAGCCTAATG 


AT AAGG CTGG 


154020 


TGAAGTACGT 


ATAGGTGAAA 


TCAATAATGG 


AATTCAAGTT 


GGTG CAAAAT 


ATGATGCAAA 


154080 


CGACATCGTT 


G C AAAAATTG 


CTTATGGTAG 


AACTAACTAC 


AAATATAACG 


AAGCTGACGA 


154140 


GCATACACAG 


CAATTAAATG 


GTGTATTAGC 


AACTTTAGGC 


TATCGTTTTA 


GTGATTTAGG 


154200 


CTTATTAGTG 


TCTCTAGATA 


GTGGCTATGC 


AAAAACTAAA 


AACTATAAAA 


CTAAACACGA 


154260 


AAAACGCTAT 


TTCGTATCTC 


CAGGTTTCCA 


ATATGAATTA 


ATGGAAGATA 


CTAATGTCTA 


154320 


TGGCAACTTC 


AAATATG An C 


GCACTTCTGT 


AGATCAAGGT 


GAAAAAACAC 


GTGAACAAGC 


154380 


AGTATTATTC 


GGTGTAGATC 


ATAAACTTCA 


CAAACAACTA 


TTAACCTATA 


TTGAaGGTGC 


154440 


TTACGCTAGA 


nCTAAAACAA 


CTAATGGAGG 


CGTAAAAACT 


GAAAAAGAAA 


AATCAGTGGG 


154500 


TGTAGGTTTA 


CGCGTTTACT 


TCTAATCATT 


TGTTAGAAAT 


ACATTATTAA 


AAGCAAGGCG 


154560 


AATCGAAAGA 


TTCGCTtTTT 


TTTTTG CTC A 


AAATCAAGTT 


AAAAAATGAT 


TAAGTTAAAA 


154620 


GTGTATAAAT 


ATTTAGGCTA 


TTTTATAATT 


AATAAAAAAt 


CTGTGATATA 


TATCACAGAT 


154680 


TTTTAAATCA 


ATTAACTATT 


TAAGTGTTTA 


CTATTAATTC 


TCTTTCCACT 


TTCCGTTTAC 


154740 


TACTGTGCCG 


ATTACTTGGT 


AATTTGGCGT 


AAACACGGCT 


AAGTTTGCTA 


TCTTACCTTT 


154800 


TTCTACCGAA 


CCTAAACGAT 


CATCTATACC 


AATTGCTCGT 


GCAGGATAAA 


GGTTACTCAT 


154860 


TCTAATGGCT 


TCCGCAAGAG 


GAATTTCGAC 


ATATTCTACC 


GCATTTTTAA 


TGGATTCCAT 


154920 


CATTGTGATT 


GATGCCCCTG 


CAATTGTGTC 


ATTCGCATCA 


TAGCAACGCC 


CTTCTTTGAT 


154980 


AT AAATTG TT 


TTTCCCACAA 


AGGTAAAACT 


TTCTAGTTCT 


GGCGGTGCGC 


CTGCAGCTGC 


155040 


AATAGAGTCA 


GTCACAATAC 


AAAGTTTGTC 


GCCTTTGATT 


TTTTTATCAA 


TGCGAACATT 


155100 


GCCGTAATTG 


AT ATG C AC AC 


CATCGACGAT 


AATGCCAGTG 


TAAACATCAG 


AATCTAATAC 


155160 


TGCACCCACT 


ACGCCCATTT 


CACGTCCAGA 


ACTAATTGGC 


GACATCGCAT 


TATGCAAGTG 


155220 


AGTCGCAAAA 


GTTGCGCCTT 


TGTGGAATGC 


GGCTTTAGCC 


ACTTCATAAG 


TGGCATTAGA 


155280 


ATGGCCTACA 


GAGACAATAA 


TGCCCGCTTT 


CACAAAATCA 


GGCGTATAGT 


TAATGGTTGG 


155340 


GTTTTCTGCG 


GCAATGGTCA 


TT TTTGTAAT 


CACATCACCA 


TTTTCGCAGA 


GAAAATCTTT 


155400 


CATTTCAGGC 


GTAATTTCAC 


GAATATATTC 


GGGACGGTGT 


ACGCCTTTTT 


TCTCAATGCT 


155460 


GAGATAAGGT 


CCTTCAATAT 


GTAAACCAAG 


TGCTTGGTTT 


TTATGCTTGT 


TTAAAT ATT C 


155520 


ACGCATAATT 


TTTACCGCAC 


TTTTTATATT 


TTCATCAGGC 


GCGGTAATAA 


AAGTGGGAAG 


155580 


GAAACTTGTA 


CAACCTGATT 


TTAAGTTAGT 


TTCCTGCATA 


ATTTCCAAGG 


TTTCAACGCT 


155640 
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AGTTTGGTCG TTAAACATCA CGCCACCACA GCCATTTAAT TGCAGATCAA TAAAACCAGC 155 700 

CGTTAAATTA TTACCTTGTA AATCAATGGT TTTGATACCT GTTTCTAATT CAGCTTGGGG 15 5 760 

GATCACTGCT TCAATAATTT CACCGTTAAT AATTACAGCA AAATCACGTA AAAC AT CAT A 15 3 6 20 

TTTTGTGTAA ATTACACAAT TAATTAATGC GTATTTCATA TTCTGTTCCT AATATCAACA 155 880 

TCAGTTTAAC ATAGCAACTA TATTGCTCTT AAACGTAAGG AC AATTTG C C CTATGATTGT 15 5 940 

TTGTGATTAT AGGGCGAATG TAATTCGCCC CTACATAAAG ATGAAGAATA GATCTCTATT 156000 

TATCCTGATA ATCCGTACCA GCAACCGCGC CTTCTAATTC AG T AAAAT AT TTCACGGTTT 156 06 0 

TCACTTTTAG CTCTTGTTGA GATTATAGGG CGAATGTAAT TCGCCCCTAC ATAAAGATGA 15612 0 

AGAATAGATC TCTATTTATC CTGATAATCC GTACCAGCAA CTGCGCCTTC TAATTCAGTA 156180 

AAATATTTCA CGGTTTTCAC TTTTAACTCT TGTTGAGCAG CTTCATCACA AACAAGCAAG 15624 0 

AAATGACGGT GCATTTGTAA TGCACTGACT GTC CAT AAAT GGTTAATACT GCCTTCAACT 156300 

GCAGCTTGCA CTGCCAGTGC TTTTTGATGT CCTGTTGCTA AAATCATCAC TTCTTCTGCA 156360 

TCTAATAATG TGCCAACACC AATGGTTAAA GCATATTTCG GCACTTGAGT AACATCATTA 15642 0 

TTAAAGAAGC GTG AATTTG C AATCAATGTA TCTTGCGTTA GCGTTTTAAT ACGGGTGCGA 156480 

GAACTTAATG AAGAAG CAGG TTCATTAAAT GCGATATGTC CATCTACGCC AACCCCCCCC 156 54 0 

ATAAACAGAT GAATTTTGCC GTAGGATTTA ATTTTTTCTT CATAACGACG GCATTCTGCA 156600 

TTATGGTCAT CGGTATTGCC GTTAAGAATA TTGATATTTT CAGGTAAAAT ATCGATATGA 15666 0 

TTGAAAAAAT TGTTATACAT AAAGCTGTGG TAACTTTCAG GGTGTTCCTC TGGTAAAGCC 156 72 0 

ACATATTCAT CCATATTGAA AGTCACTACG TGTTTGAAAC TCACTTTGCC TGCTTGATAA 156 78 0 

AGTCGAATCA GTTCTTGATA AGTTTTTAGT GGTGTGCCAC CAGTAGGTAA CCCCAATACA 156840 

AAAGGACGCT CGGCGGTAGG TTTAAAATCG TTGATACGAT TGATGATATG TTGTGCAGCC 156 90 0 

CAACAACTGA CTTGTTGTTC TGTTTGTAAT GGAATAAAAC GCATATTAGC CTTCCTTTAT 156 96 0 

TATTGACCGT GATGATCCTA TTCATATCGT AGGCTGGACT TTAGCCCACC CTACACCAAA 15 702 0 

GGATCATTAT AAGTGCGGCA ATTTTCTATC TTGTTTTTAA AAAGAAATTC TTTTGCTCCA 15 708 0 

ATAATTTACG AC AG AG CAAT AGAAGTTGCC CCCTTCCGCC CTTCGGGCAC CTTCCCCCGT 15 714 0 

AAACAGGGGA AGGAAAGTG C TTGCTGACTT GCTGTTCTGT TTGTAACGGA ATAAAACAGG 15 7200 

GGGAGGTAAG TTTTTTCTAA TTTAAATATT TATGACAAAA ATTTCGCTTT CAAGTCTTTT 15 7260 

GCTTTTGCAA GCTGTTCTTC TGTTGCTTTT GATGTCATTG GTTCACGGCA ATAACCTGCA 1573 2 0 
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TCCACCCCTT CTAATTTAAG CAATTCTTTG ATGGTTAAAT ATAAACCGTT CGCTAAAATG 15 7380 
CCTTCGATTA AATCGTTGGT TACGTGTTGG ATTTCTAACG CTTCGGCTAA TTTACCCGCT 157440 

TTGGTTAATT CAAAAATTTG ACGAGCACGA ACGCCGTTTA CGTTAAAGGT TGAACCAATT 15 7 500 

GCACCATCTA CCCCAAGTGA TGCTGCAGGT AACATCATTT CATCGAAACC TGCCCAAATT 15 7560 

AAGTGGTTTG GATAAGCTTT TTTCAAACGT TCTAATAAAT AGAAATCACC TGCTGTGAAT 15 7620 

TTTACCCCTA ATACTTTCGG GTTTTTATAA AGTTCACCGA ATTGTTCAAT GCCCATATTC 1576 80 

ACGCCAGTTA AGAACGGGAT AGAGTACACA ATCATATTGT TGCCTGTTTC AGCGATGATG 15 7740 

GTGTCGTAAT AATGTTTAAT TTCAGGGAAG CTAAATTTGT AATAGAATGG CGTCACTGCA 1578 00 

GATAAGCAGT CGTAGCCTAA TTCTGTTGCA TATTTACCTA ATTCCACCGC TTCTTTTAAG 157860 

TTTACACTAC CCACTTGAGC AATTAGTGCG ATTTGATCTT TTGCTTCATC TTTTGCAATG 15 792 0 

CGGAAGATTT CTTTTTTCTC TTCCGTTGAA AGCATAAAGT TTTCGCCTGT TGAACCGCCC 15 7980 

ACATATAAGC CATCAACTTT CATTTTGTCG ATATTATGGC GGATAATTTG GCGTAAACCT 15 8 040 

TTTTCATTAA TGGTGC CATC TTCATTAAAT GACACTAATA ATGCACTGAA AATACCTTTT 158100 

AAATCACGCA TAAGTTTTCT CCTATTTTAT ACGTTGTTCA AGAATTACAT TGAGAGTTTT 158160 

TTGTTTAGTT TG AG CGGCAA TATCTTCTTC GCCCTGCACT AACAAAGCAT AAATTAAATC 158 22 0 

TAATACAAAA AGCTGTGCAA TTTTGGTGCC AATAGAATCG CCTTGTAACT TGCCTTGTTT 158280 

ATTACCATTT ACCAAAACAT AATCAGCATA TTCTGTCACG GGGGAACGCA AACTATG CGT 158 34 0 

CAATGCCACT GTTGTTGCAC CATTTTGTTT TGCGATTTTG ATGGTATGCG CCGTTTCAGC 158400 

AGAATAACCA GAATGGCTTA AACCAATTGC CACATCAGAA GAGGTCAATA ATGCCGCCTG 15846 0 

CATATACATA AAATGATTAT TACCTGAAGC ATCCACTTGA AAACCAATAC GCATCAGCTT 158 52 0 

ATTTTTAGCA TCTTCAGCCG TTACGCCAGA TGAACCCACC CCAAATAAAA AAATACGGCG 158 58 0 

TGCTTTTTTA AGTACTTTAA CGACTTCTTC AAGCTGTTTT AAATCAAGTA AATTAATCGT 15864 0 

TTCTTCCATT ACATTAGCAA CCGCCGTTTG CAATTTTTGT GCAATAGTGA GCGAATCATC 158 700 

ACTTGGCATA ATTTCTGTTT CAAGTATGCT TTCATCTTGA TTATCTTTCG TCGCAAGTTC 158 76 0 

AATAGAAAGC TCTAGTTTGA ACTCGCTGAA AC CTTTAAAA CCAATGGTTC GACAAAAACG 158820 

GACGAGCGTA GCTTCTCCAA CTTG CAAATG TTTAGCGATT TCTGAAAGGG AGCATTGAGA 15888 0 

GACTAAATCA GGCGAACGCA AAATCGTATC GGCAATTTTT TTCTCTGATT TAGTTAAACT 158 94 0 

CTGGTAAAGT GAGCCAATTT TGTTTAACAC ATTTCCCGAT TTAGCCATAA ATTGTTCCTT 15 9000 
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CAGGCTTATC CAGTAAAACA TCTTTTACCC AATAAGCTGC GCCAATTAAA CCTGCATCTT 15 906 0 

GTCCAAATTT AGCGGTTTCA ATTTCACAAC AATATATCGA AGGAAAATCT TGTAAATATT 15 912 0 

TTTCAACCAA AGATAAATAT CCCTCAGCTA ATCCCACACT GCCACCAATC GCGATTTTTT 15 918 0 

GAATATCTAA ACTAATCACT AAATC AG CAA TTAAATT AG C AATGGCTTTC G C AG AACGCT 15 924 0 

CGACAAGTGC GGTCGCTTTT TCATCATTTT TTCTAAAACG CTCAAACACT TCTTTTGGAT 15 93 00 

CGCACGGTTC TTCCCATTGG GAAGAAACTG CTTCAATTGC TCGCCCTGAA GCAATCGCTT 15 9360 

CGACACAACC TCGACGCCCA CAACCACAGA TCGCACCATT TGGATCAGCT AATGTATGAC 15 9420 

CAATATGCCC AGCAATACCA CGAGAACCTG TCTGCAAAAT TTGGTTCAG C ACAATCCCTC 15 9480 

CGCCAACTCC TGTTGAAACG GTAATAAAAA CAAAATTTGA TACCTGTTCA GAATTTTGCA 15 9540 

ATTGATATTC CGCATAAGTC GCAGCTTGTG CAT C ATTT AA TAAACCAATC GGTTTATCCG 15 9600 

TATGTTTTGC AATACTCGCT TTAAGTGGAA ATTCCGCCAA GCCACCTAAG TTTTTCGGAT 15 966 0 

TTAATG C ACT TAAAATGCCA TTATTGATGA TCCCCGTAGA TGCTACCGCT ACATAATCAA 159720 

ACTGCCCTTC ATAATCAGCA AG CAATTTG C CTAACGCTTG GTGCATTCCC TCCACGACAT 15 978 0 

TTTCACGTGG TGTATGAATT TGCTGACGTT GCTCAATTTC GCCATTTTTT ACAATCGCCG 15 984 0 

CTGCAATTTT TGTCCCACCA ATATCTAGTG CTAAACAACG CATAACTTAT C CTTATAGTG 15 9900 

TAAAGTCkTT nTCTCACACG TCTTACGACT ATCTTGCAGA TTTCACTGAA TTAGCAAACC 15 9960 

AACTCACAAT ATGTTCAAGA CGAGTTAAGG CTGAACCTAC AGTCACGCAA TCTGCACCAA 16 002 0 

TTTCAATTGC CACTTTTGCC AATTCTGGTG TGTTATAACG CCCCTCAGCC ATCACAAAAC 16 008 0 

AACCCGCTGA TTTTAAATCT TTCACTAATT GATAATCTGG TTCTTCTGGC ACTGCGCCAC 160140 

CTGTATAACC TGACATCGTA CTGCCGACAA TATCAAAACC AAGCGCTTTA CAATACAAGC 160200 

CTTCTTCTAA ATTAGGaACA ATCTGC CATT GCCAAACAGC CCATTTCGTG GATTTTTTTC 160260 

ACCGCACTTT CAATATCCAC AGGACGTGGA CGGCTCGTGC CATCTACTGC AATAATATCC 16 03 2 0 

GCGCCAGCAT TGGCTAGATC TTCAATATCT TGTAAAAACG GCGTAATACG CACGGGACTA 16 03 80 

TCAGGCAAAT CACGTTTTAC AATTCCGATA ATCGGGACAT TCACAAAAGG ACGGGTTGCT 16 044 0 

TTAAGGTTAT CAACACCCTC AATACGTAAG CCAGAACGTC CGCCCATAAC TGAAGC CTG A 160 500 

G C CAT AG C AG AAACGATTTC AGGTTTATCC ATTGGGCCAT CATCGACAGG TTGGCAAGaA 16 056 0 

GAAATAAGAC CATATTGAAT TTGAGAAAGC ACTTCTTGAT AAGATAATTT AG AC AT AT AA 16 0620 

CACCCCTCAT TTAAATCTGA ATAAATCACT TTAAAATTAA TTTAATCCAC ACAAAACAAT 16 0680 
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CTTTTATAAG TTGTAATATG CTTATATTTT CATCATTTAA CCTTATTATA TGAAAAATAA 16074 0 

ATGCCATTTT AAAGAATATT TTTGAAGTGT TTTTTCATAA ATGTGATCAA CTTCTCATTT 160800 

TGGAAATAAA ACTCGTAAAA CTATTTTAAA TTTGTTTTTT TACTCTTAGA ATTGCTCTCG 160860 

AATCCTATTT TGGGGGAGAA AACATACATT TTTGGCCTAA ATTGATTCTC AT ATT AC G GG 16 092 0 

GTTAAT c TAT GCCCCTTTCC AACATAAGTT CACACAGGAG CGAATATGAT GAAATTGACA 16098 0 

AAACTTTTCC TTGCCACCGC CATTTCTTTA GGCGTATCTT CTGCTGtTCT TGCCGCTGAT 16104 0 

TATGACTTGA AATTCGGTAT GAATGCTGGA ACTTCAT CAA ATGAATATAA AGCGGCAGAA 16110 0 

ATGTTTGCCA AAGAAGTCAA AGAAAAATCA CAGGGTAAAA TTGAAATTTC ACTTTATCCA 16116 0 

AGTTCACAAT TAGGTGATGA CCGTGCAATG TTAAAACAAT TAAAAGACGG TTCTCTCGAC 16122 0 

TTTACCTTTG CAGAATCTGC TCGCTTCCAG CTGTTTTACC CTGAAGCGGC AGTATTTGCC 161280 

TTACCTTATG TTATTAGCAA CTACAATGTT GCACAAAAAG CCTTATTCGA TACAGAATTC 16134 0 

GGTAAAGATT TAATTAAAAA AATGGATAAA GATCTTGGCG TGACTTTACT TTCCCAAGCT 1614 0 0 

TATAACGGAA CTCGCCAAAC GACTTCAAAT CGTGCAATCA ACAGTATTGC AGATATGAAA 161460 

GGCTTAAAAC TTCGTGTGCC AAATGCAGCA ACAAACTTAG CCTATGCTAA ATATGTTGGT 16152 0 

GCATCACCAA CACCAATGGC ATTTTCTGAA GTTTATCTTG CGTTACAAAC CAATGCCGTC 16158 0 

GATGGTCAAG AAAACCCGTT AGCAGCGGTG CAAGCACAAA AATTCTATGA AGTGCAAAAG 16164 0 

TTCTTAGCAA TGACTAATCA TATTTTGAAT GACCAACTTT ATTTAGTAAG CAACGAGACT 161700 

TATAAAGAAC TCCCTGAAGA TCTTCAAAAA GTCGTAAAAG ATGCTGCCGA AAATGCAGCA 16176 0 

AAATATCACA CTAAATTATT CGTAGATGGA GAGAAAGATT TAGTCACATT CTTTGAAAAA 161820 

CAAGGCGTGA AAATTACACA TCCTGATCTT GTTCCATTTA AAGAATCAAT GAAGCCGTAT 16188 0 

TATGCTGAGT TTGTAAAACA AACTGGTCAA AAAGGTGAAT CAGCTTTAAA ACAAATTGAA 16194 0 

GCAATCAATC CATAATCCCT CTTTAGTGTG CCGTAATGGC CCTACATTCC CTTATTCTTC 16 2 000 

ATCAAACGGA GTGAGTATGA AATATATTAA TAAGCTTGAG GAATGGCTGG GTGGCGCATT 16 2 06 0 

ATTTATCGCC ATTTTCGGTA TTCTTATCGC TCAAATTCTT TCACGCCAAG TTTTTCATTC 16 212 0 

TCCGTTAATT TGGAGTGAAG AACTCGCCAA GCTCTTATTT GTTTACGTGG GTATGTTGGG 16218 0 

TATCAGCGTT GCTGTGAGAA AACAAGAACA CGTATTTATT GATTTTTTAA CTAATCTAAT 16 22 4 0 

GCCCGAAAAA ATCAGAAAAT TCACAAATAC GTTTGTACAA TTATTAGTCT TTATATGTAT 162300 

TTTCTTATTT ATTCATTTCG GTATTCGTAC TTTTAACGGT GCATCATTCC CTATTGATGC 16236 0 
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CTTAGGAGGC ATTTCTGAAA AATGGATTTT CGCAGCACTG CCTGTTGTCG CAATATTAAT 162420 
GATGTTTCGC TTTATCCAAG CGCAAACCCT AAACTTTAAG ACTGGGAAAA G CTATTTACC 1 62480 
TGCAACTTTC TTTATCATAA GTGCGGTCAT TTTATTTGCG ATTTTATTTT TCGCGCCAGA 16 2 540 
TTGG TTCAAA GTATTGCGTA TTAG CAATTA TATAAAACTC GGTTCAAGTT CAGTCTATGT 1S2600 
CGCCTTACTC GTTTGGCTAA TCATTATGTT TATCGGTGTC CCTGTAGGTT GGTCCTTATT 1S2S60 
TATTGCTACT CTACTTTATT TTTCTATGAC ACGTTGGAAC GTCGTAAATG CCGCAACTGA 152720 
AAAATTAGTC TATAGCCTAG ACAGCTTCCC ATTACTTGCC GTGCCATTTT ATATTTTAAC 1627 S0 
GGGCATTCTA ATGAATACAG GTGGGATTAC CGAACGTATT TTTAATTTTG CTAAAGCCTT 162 840 
ACTCGGTCAT TACACAGGAG GAATGGGACA CGTTAATATC GGCGCAAGTT TATTGTTCTC 16 29 00 
TGGTATGTCA GGTTCAGCAC TTGCTGATGC GGGGGGCTTA GGTCAATTGG AAATCAAAGC 16 29 60 
AATGCGTGAT GCTGGTTATG ACGATGATAT TTGCGGAGGA ATTACTGCTG CTTCTTGTAT 16 3020 
TATTGGGCCA TTAGTTCCGC CAAGTATTGC AATGATTATT TACGGTGTAA TTGCCAATGA 16.080 
ATCTATCGCA AAACTCTTTA TTGCAGGTTT TATTCCAGGT GTATTAATTA CTTTAGCTTT l 63 i 40 
AATGGCAATG AATTATCGCA TTGCAAAAAA ACGAGGTTAT CCACGTACAC CAAAAGCTAC 163.00 
GAGAGAaCAA CTTTGCAGCA GCTTTAAACA ATCTTTTTGG GCAATCTTAA CGCCGTTATT 163260 
AATTATCGGT GGTATTTTTT CAGGCTTATT CAGTCCAACA GAATCTGCCA TTGTTGCAGC 163320 
AGCATACTCT GTAATTATTG GTAAATTCGT GTATAAAGAA TTAAC CTTAA AAAGCTTATT 163380 
TAATAGTTGC ATAGAAGCAA TGGCAATTAC GGGCGTAGTC GCCTTAATGA TTATGACCGT 16 3440 
GACTTTCTTT GGCGATATGA TTGCGCGTGA ACAAGTCGCA ATGCGTGTTG CTGATGTGTT 163500 
TGTTGCCGTT GCCGATTCGC CTTTAACCGT ATTGATAATG ATTAACGCAC TGTTACTTTT 16 3 56 0 
TCTTGGAATG TTCATTGATG CCCTAGCATT ACAATTTTTA GTATTACCAA TGCTTATTCC 1636 2 0 
TATCGCAATG CAGTTCAATA TTGACTTAAT CTTCTTTGGT GTAATGACCA CATTAAATAT 163 680 
GATGGTTGGT ATTCTTACCC CACCAATGGG AATGGCTCTC TTTGTTGTTG CTCGTGTAGG 1 63740 
AAATATGTCA GTTTCCACGG TAACCAAAGG CGTATTACCG TTCTTGATTC CCGTTTTCGT 16 3 800 
CACATTAGTA TTAATCACGA TTTTCCCACA AATCATCACA TTTGTGCCAA ATCTATTGAT 163860 
ACCATAATAA AAAGTGCGGT TGAAAAACAT TAAATTTTTT AACCGCACTT TCATCGTTTT 16 3920 
CGTAAGTTAC ATCTGCCGTA T CTTACAATA CAAAAGGGTG CAATAACTGC ACTTTATCAA 163 980 
CGCCTATTCA ATGCCGTTTA ATGAATAGAA AACTGTAAAC ACAACT CAAA ATGGGGTTTA 16 4 040 
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TTATGAAACT AACTAAAACA GCATTATGTA CTGCACTTTT TGCCACCTTT ACTTTTTCAG 16 4100 

CGAACGCACA AACTTATCCT GATTTGCCAG TGGGGATCAA AGGAGGAACA GGCGCATTAA 16416 0 

TTGGCGACAC AGTTTATGTG GGATTAGGTT CTGGTGGCGA TAAATTCTAT ACCTTAGACC 16422 0 

TAAAAGATCC TTCAGCACAA TGGAAAGAAA TTGCTACATT TCCGGGTGGC GAACGCAATC 16 42 80 

AACCTGTTGC AGCCGCAGTG GATGGAAAAC TTTATGTATT CGGTGGTTTA CAAAAAAATG 16434 0 

AAAAAGGCGA ACTTCAACTC GTTAATGACG CTTATCGCTA TAACCCAAGC GATAATACTT 164400 

GGATGAAACT TCCTACTCGT TCTCCCCGTG GTTTAGTAGG ATCAAGTGGT GCATCTCACG 16446 0 

GAGATAAAGT TTATATTTTA GGCGGCTCTA ATCTCTCTAT ATTTAATGGC TTCTTCCAAG 164 52 0 

ACACTGTGGC AGCAGGTGAA GATAAAGCGA AAAAAGATGA AATCGCAGCA GCTTATTTCG 164580 

ATCAACGTCC AGAAGATTAT TTCTTTACAA CAGAATTATT GAGCTATGAA CCATCAACCA 164 64 0 

ATAAATGGCG CAATGAAGGT CGTATTCCAT TCTCTGGTCG TGCTGGTGCA GCCTTTACAA 164 7 00 

TTCAAGGTAA TGAGCTAGTG GTTGTCAATG GCGAAATTAA ACCTGGACTT CGCACCGCTG 164 76 0 

AAACCCATCA AGGTAAATTC ACTGCTAAAG GTGTGCAATG GAAAAACTTA CCTGACTTAC 1648 2 0 

CTGCTCCAAA AGGCAAATCA CAAGATGGTT TAGCTGGTGC GCTTTCAGGC TATAGTAACG 16488 0 

GTCATTATTT AGTCACTGGC GGCGCAAATT TTCCAGGTTC AATCAAACAA TTCAAAGAAG 164 94 0 

GAAAACTTCA CG CAC AT AAA GGTTTAAGCA AAGCTTGGCA TAACGAAGTT TAT AC G TTG A 16 5 00 0 

ATAATGGTAA ATGGCGCATT GTTGGGGAAT TACCAATGAA TATTGGTTAT GGTTTTTCTG 16 5 06 0 

TATCTTACAA CAATAAAGTT TTACTGATTG GCGGTGAAAC TGATGGAGGT AAGG CTTTAA 16512 0 

CTAGCGTCAA AG C AATAAGC TATGACGGTA AAAAATTAAC C AT CGAATAA TTTATCACAG 16 518 0 

AAT AGTG CAT AAAATTTATA ATAATCAAAA GCGGCGAAAT TTATCGCCGC TTT ATG C AAA 16 524 0 

ATAAAAGTAA TTAAAGCAAA ATTAAGACAA ATTTTGACCG CACTTTAAAT GCGCTATAAA 1653 00 

GAGCTGGCAT TTTTATTAAT AAGATTTAGC GATGTTATTT ATTCCACCAC CACTACTCTG 16 53 6 0 

TTTATTCATT GCCATAGCAA TGTATTTTTT G C CAAAGATT GCTAGTTATT CTGTCCATTT 16 54 2 0 

TTCAGTGATT GTTTTTGTTA TTTCCCTTTC ATTTTTGATT GCTTTAAGCA GCGTTATGCA 16 54 8 0 

ATCTTTATAT GTAAAACCTC CATTAATCCT CGTGACTTTA AAGGCACAAC AAAATTAGTT 16 5 54 0 

TCCACAGGCA TATTTCGATT TAGCCGTAAC CCAATGTATT TAAGTTTGCT GTTAATTTTG 16 56 00 

CTTGCTTGGA CGCTTTGGTT AGGTAATAGT TTGGCTTGGT TAGGCGTAAT TATCTTTATA 16 5660 

TTAGTGATAA ACCAGTTTCA AATAGCCCGA GAAGAAACCT ATTTGGAAAG TAAATTTGGC 16 572 0 
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GATGAATATC GTCGTTACAA ACAAAAAGTA AGACGTTGGC TATAAAAAAA CACCTTCACA 165780 
AGGAAGGTGT TTTTATCAAA TTAAACTCTA TTTTTTCGGT GCCTGCATAA AGCGGAAGAA 165840 
ATCACTATCT GGTTTTAAAA TCATTATATT GTCAGAATTT GCAAAACTCG CTTCATAGGC 165900 
TTTTAAGCTA CGCACAAAAG TAAAGAATTG TGGTTCTTGA GCAAAAGCAT CTGAATATAA 165 960 
TTTTGCTGCA GCTGCATCGC CTGAACCTCG TAATTCTTGT GCAGTTTTAT TTGCATTAGC 166020 
AAGAATTAAC GTCACTTTGC GATCTACATC AGCTTGAATA AATGCTGCTT TTTCTTTCCC 166080 
TTGAGAACGA TGTTCACGCG CCACTGCATC ACGTTCTGCA CGCATACGTT GGTAAATTGA 1663 40 
AGAAGAGACT TCATCTGGTA AATTAATTTG TTTTACACGC ACATCGATCA CTTCAATACC 166200 
TAATTCTGCC GTACTGTCTT GTCCTGAACT TAAAGCTTTT TTTGCCCCTT CCATTAATTC 16626 0 
TCCACGTGTA CCAGAAACAA TATCTTTAAT CGTGCGTGAA CCAATTTCAG AACGTAAACG 166320 
GTCGTTTACT TTACGACTTA AT AAATTTG C AGCTTGAGCA TAATCGCCAC CGCCAGTAGA 166380 
TGTGTAGAAT CGACCAAAAT CACTGATTTT CCATTTCACA TAAGAATCCA CTAGCAAATC 166440 
TTTTTTCTCA ACAGTAACAA AACGTGTTGC AGAGCCATCT AAAGTACGAA TACGTGCATC 166500 
TAAAACTTTG ATGCTATCAA TTAAAGG CAC TTTAAAATGC AAGCCCGGTT CATATACAAC 166560 
CACTTTGTTA TCAGCATCAC GCTGTACTTT GTTAAATCGT AACATAATGC CACGCGTACC 166620 
TTCCGTTACA ACGACGATAC TAGAATAAAC GACTGCCGCA ATCACGAAAA TAACAGGTAA 166680 
TAAAAATCTA CGCATTAGTT AAATCTCCCT TGACGAATCG GCGCAACGGT AGTCGGCTGA 166740 
GGCTGATAAG AATTTTGACG TTCTGGTGCA GTAAAAGCAG GCGAAGAATT GACCGCACTT 16 6 8 00 
GGCGCACTTG TTACCGATTT TTTGCCCATA ATTTGCTCTA GCGGTAACAC AGTTAAATTA 166860 
TTACCATTAT TGCCATCAAG CATTACTTTT GGGGTATTTG CCATCACTTT TTCCATCGTC 16692 0 
TGAATATATA AACGCTCACG CAGTAAATCT GGCGCAGCCT TAAATTCTGG CAACAAGCGT 166 98 0 
TGCAAACGTT CCACTTCACC TTTTGCATCT AACACGATAC GATCTTTATA AGCTGTTGCT 167040 
TCTTCAAGAA TACGTTGCGC ATCACCACGT GCAATCGGTT CTTTCTCACG CGCATAGGCT 167100 
TCCGCTTCAC GAATAAAACG TTGTTCATCT TCCTGCGCTT TAATTGCATC ATCAAACGCA 167160 
TCTTTTACTT CTTCAGGTGG GCGCGCAGAT TGGAAGTTCA CATCAATAAC CTCAAGTCCC 167220 
ATATCATAGG ATTTGATAAT TTCATTCAAA GCTTTCCAAG TATTTTCACG CACGACAGAG 16 7280 
CGACCTGTGG TTAAAATATC ATT CAT AG AC ATATG CCCAA TAACATAACG TAATGCACTA 167340 
TCTGTCGCTT GATTCAAACT ATCATCTGCA TTTGTTACGC TAAAAAGATA TTTTGCAGGA 16 7400 

-77.101- 

SUBSTITUTE 5HEET (RULE 26) 



BNSDOCID- <WO_9633276A1J^> 



WO 96/33276 PCT/US96/05320 

TCTTGCACAC GATATTGCAC GGTCATTTCC ACTTTTACCA TATTTTCATC TTGAGTCAGC 16 74 60 

ATTGCACCTT GTGTACGCAA CTCTTTTACT TGCTCCACAT TTACTGGCAA TACTTTATCT 16 7 52 0 

ACAAATGTTG GTTTCCAGTT CAAACCAGGT TGAACAATAG AATGTAACTC GCCAAAACGA 16 758 0 

AGCACCACAC CGCGTTCAGC CTCTTTAATG GTGTAAAAAC CGTTTACGCC CCAAATAATC 167640 

GCACCAATCG CTACAGCAAG TGGAATAACT TTACCAAAAT GAAATGGCGT ATTGCCTTGA 16 77 00 

GAACTGCCAT TATTTTG AC C GCTCTTTTTA TTTCCACCAC CTAATTTTTT AAG C AG ATT A 16 776 0 

TTAAAAATCT C CTCAAT AT C TGGTGGAGAT TGCTCTTGAT TTCCACGATT TTGATTGTTA 16 7 82 0 

TTCC ATC CAT TATTATTGGA TGAATTTCCC GGCTGTTGGT CGTTGCTTTG CCCTGGTTTA 16 7880 

CTCCAAGGAT CGCGATCTGA ACCGTTCTGT G AC ATTT CAT TCTCCATTTG TCAAAAAATT 16 7 94 0 

GTTAAGAGTT TAGCTAAATA AATGGGGCAC GTCTATGAAA ATACAAGGCT TTTTAGGCAA 16 800 0 

TAAAAAATCC TTGATTTTAT ATAAGGATTT TTTTGATTAA TTCACTTCAT AAAGAACATA 16 806 0 

TTTTTCAAGT TCTTTTATCT GTAAACTTTC TCT ATTC CAT TCATAAAAGT GCGGTAATTT 16 812 0 

TTCTTGATGA TTGACAAAAG CGGCTAAATA AATAGGGAGT TCATCAGTAA AC C AC AACTG 16818 0 

CCCCTGCGGA CAATAATCTG AAAAAATTTT TTGTTGTTCA GGAAAATGAC GAACATTGGA 16 824 0 

TAATTTTACG ATCCCAAATC CTTG AG ATT T CAATACAGCC TCTCGCAAAC ACCAACAACG 1683 00 

ATAAAAAGCG TTCAAAGAAT CCTGCTGATG ATGAAACCAA TCAATTTCTT CTTTTGGTGC 16 836 0 

AATATGTTCC ATCAACGCCG TAAAATTTCT TATTTTAGGA AATTCAATAT CAATTCCCAC 168420 

CGCACTTTTT TCTTCATTTC TAATATCTAA TATTACCGCC ACCCAATCAC CAGAATGGCT 16 8480 

AATATTAAAA TCTATTCGCT CATCAAGAAA ATAAGGTCTG CCACTTTCAG TACGATGAAT 16 8 54 0 

TTGAGATAAA AGTGCGGTGG ATTTTCCTGC TATTTTTAGA AGTTGGAAAA GTAATAAATG 16 86 00 

CGCAAGCCGA CG AC AC T GAT GACGCTGAAA AACACGGGAG CTATCCGTTT CAATTTGATA 16366 0 

TAGATTTTCT GGAATCAGTT CATCGGGTAA GGATTCCAAA GAAAAAGGTT GATTTATATT 16 872 0 

GCCGTAGGCG ATGTAGGTTG TCATAAAATA AAAG CAC CAT TCAAAGGTGC TTTTATATTA 16 8 78 0 

AAAGGATTTA GTTTGTGTGT AAATTAGCGA ATCACTAATT CCGCCACAGA AACACCGACC 168 84 0 

AAGCAAGCAA CAACAACCCC AATTAAACCT GGCACCATAA AACTGTGGTT AAAATAATAT 16 8 900 

TTACCAATTT TTGTTGTACC CGTTACGTCA AAGTTTACTG TTGCAATATC TGACGGATAA 168 96 0 

TTTGGAATGA AGArATACGC ATAAGTTGCA GGCATTAAAC CAACTAAGAT TGGCGCAGGA 16 902 0 

ATACCTAAAC CAATACCAAC AGGTAATAAC ATTACGGCTG TTGCTGCTTG GCTATTAATT 16 9080 
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ACAACTGAGA CTGCAAATAA CGCCAATGCG AATGTCCAAG GGTAAGTTTG CACCATATCA 169140 
GTAATCGCGG CTTTAAATTC TGGCATTGCA TATTTGAAGT AAGTGTCGCT CATCCACGCA 169200 
ATACCATAAA TTG CAATCGC TGCTACCATA CCTGACTTAA ATACCACACC ATTTTG CACG 169260 
ATTTGAGGAT TCGTTTTCGT TGCAAGTAAA ATCACACCAC CAAAACAAAG CATCATCATT 169320 
TGAATGATAA GTGACATTGA AATTGCTTTT CCGCTACCGA TAGTGCGAAT TTCAGGAATC 169380 
ATCGCAATCG CAACAATCAC AACTAAAGCA AGTAAAAATA GGTAAACAGA TTTTTTTGCC 169440 
CCTTGCGGAA GGGTTTCATT CAATGTTGTA CTGGTAGTAT TTAAAATACG TTCGCGCCAT 169500 
ACTGGATCTT GTAAACGACG TTGATATTCA GGATCTTGTT CTAACTCTTT ACCACGACGC 16 9560 
ATACTGTATA AAGAAAGTGC AATCGTACCC GCTAAGGTTG CAGGAACGGT TACGCAAATA 16 9620 
ATATCTAATA AACTAATATG CTCAAAAGCG GGCATTGCTG TAATTTTACC TAGATAAAAT 169680 
ACAACAGCTG CAGAAAGTGG GCTTGATGTG ATGGCTAATT GCGATGCTAC CGAAGACGCT 16 974 0 
GCCATTGGAC GTTCTGGACG AATTTTGTTT TTCAATGCAA CATCGCCGAT GATTGGCATA 16 98 00 
ATTGTGTAAA CAGAGTGACC TGTACCAAGC ATAAAGGTTA ATAAATAGAC GACTAATGGG 16 9860 
CCGAGTAAAG TGACACGTTT TGGATTTTTA CGGAGAATGC GTTCAGCAAT TTGTAACATA 16 9920 
AATTTCAAAC CGCCAGCTGC TTCAAGTACG GATGCGCAAG TCACCACTGC AAGAATGATA 16 998 0 
AGCATAACAT CAATCGGTGC TTCCCCACTG GCATACGAAA TACGAAAACC TCGATAG CAA 170040 
GACCGATACC AGAAACAACA CCTAATCCGA TACCGCCGTA ACGGCTACCG ATATAGAGCA 17 0100 
TTAATAGTAA AAATAAAAAT TCTAAATAAA GCATAAACAC CTCAAAAATG GAAAACTATG 17016 0 
CAAATAATAC TTCTTTTTAG GCATATAACT AAATTATTTT TTAAAATTTA AACCAAGATT 17 022 0 
TTAAGTAAAC GATAATTTAG ATCAATTTTT ATGAAAAGGA AAAATAAAAA GTGCGGTAAT 170280 
TTTAAAGAGG ATTATTAGAA AAAAAATTAG CGACTTAAAA CGTCGCCCAA CTTAAAATTA 17034 0 
TTGATTGTTT TGCACGTAAT CAATCGCAGA TTGAACAGTG GTAATTTTTT CAGCTTCTTC 170400 
ATCAGGAATT TCAATATCAA ATTCTTCTTC TAAAGCCATT ACTAATTCAA CTGTGTCTAA 170460 
TGAATCTGCA CCTAAATCTT CAACAAAAGA AGCTTCAGGT TTAACATCTT CTTCTTTAAC 17052 0 
ACCTAATTGT TCAACGATGA TTTTTTTCAC GCGTTCTTCA ATACTCATTT GTTTTTC CTA 170580 
TTTGTTAGTT TTAACTCATA AAAGAGTGGT TAGTGTATGT ATTTTTTAGT AACTTGCAAC 17064 0 
TATTTTATGG TCTGGTCGCA CCACAAAATA GGCAATGTGA ACACACACAT CAAATATAGT 170700 
TACGTACCTT TGCATACGCT CAAGGCGTAA CCAAGTTACA TTGCGAGCGG GATTTTAGCA 17 0760 
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TTATCTAAAA TCTTTGGCTA TGATTTTATT GCTTAGAAAT CCCCCATAAA TCAATGTCAA 170820 

ATTTTAGCTT AAGTATAAGC CACCATTCAC GTGTAACGTT GTTCCCGTGA TATAACCTGC 170880 

ATCATCAGAA GCTAAGAAAG CAACGGCTTT AGCTATGTCT TTnGCTTCGC CTAAACGTCC 170940 

AGCTGGAACA TTAGATAAGA TCCCCGCTTT TkGTTCATCC GTAAGCACTT CTGTCATATC 171000 

TGTTGCAATA AAACCAGGAG CAACCACATT TACAGTAATA CCACGTGCAG CTACTTCTTT 171060 

CGCTAAAGAT TTAGAAAAAC CAACCACACC CGCTTTTGCC GCACAATAGT TAGTTTGTCC 171120 

TGGATTACCC GTTGAACCAA CCACTGAACC AATATTGATG ATACG AC C AA AACGTTTTTT 171180 

CATCATTGAA CGTAACATGG CTTTAGAAAG ATGATACACC GAAGTTAAGT TAGTTTGCAT 171240 

AATATCAAAC CATTCCTCAT CTTTCATACG CATCAATAAA TTATCGCGAG TAATACCTGC 1713 00 

GTTATTCACG AGAATATCAA TATCGCCAAA ATCATTTTTA ATTTGTTCAA GTAAGGTTTC 171360 

GATAGATTCT TTATCCGTTA CGTTTAAAAC TAAACCTTTA CCTTTATCCC CTAAATAAGC 1714 20 

CGAGATCGCC TCTGCACCCT TTTCAGAGGT TGCTGT AC CA ATTACAAATG CACCTTTTGA 1714 8 0 

ACTAAGTTCT TCTGCAATCG CACGACCGAT ACCACGGGTA GATCCTGTCA CTAAAGCAAT 17154 0 

TTTACCTTGC ATTTTTTCTT CCTTTTACTT ACACTAAAAA TTCTTCTACC GCATTGAATG 171600 

ATGCAACATC ATTTACAGAT ATTGCTTGTA AATCACCCAC AATGCGTTTG GTTAAACCAT 171660 

TTAATACTTT ACCTGGCCCC ACTTCAGCAA GCACTAGAAC GCCATCTTGC GCCATTTTTT 171720 

CAACTG TTTC AGTCCAACGA ACTGGACTAT ATAACTGACG CACAAGTGCG GTACGAATTT 17178 0 

CGGTGCCTTC AGTTTCAGCT TTCACATCAA CGTTATTTAA TACCGATATT GTTGGTGTAT 17184 0 

TAATTTGAAT ATTCTCAAGT GTTACCGCTA ATTGCTCGGC TGCAGGTTTC ATTAATGCAC 171900 

AGTGAGAAGG TACGCTCACA G CTAACGG CA ATGCACGTTT CGCCCCTGCT TCTTTACATA 17196 0 

ATGCAGCCGC ACGCTCAACT GCAGCTTTCG CACCCGCAAT AACTACTTGA CCCGGTGAGT 172 02 0 

TAAAGTTCAC CGCAGATACG ACTTCGCCTT CCTCTGCTTG TTTGCAAGCA TTAATAATTG 172 08 0 

CTTCATTATC TAAACCAATG ATTG CAT AC A TTGCGCCAGT GCCTTCAGGC AC AG CTTGTT 17214 0 

GCATTAATTT TCCGCGCAAT TC C ACT AATT T AATCG CATC TTGGAAATCC AACACGCCAG 172200 

CACAAACTAA CGCAGAATAC TCACCTAAGC TATGACCTGC CATCACTTCT GGTTTTAATT 17226 0 

GAGGAAATTT TTCTTTCCAT ACGCGATAAA TAGCGACTGA AGCAGCTAAA AGTGCGGGCT 17232 0 

GAGTTTGCCA AGTTTTATTA AGTTCTTCAG CTGGACCTTG TTGAACAAGA TACCATAAAT 172380 

CATAACCAAG CGCATCAGAT GCTTGTTTAA ATGTTTCAAT AACGATTGGA TATTCAGTTG 17244 0 
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CAAGATCAGC AAGCATACCG ACAGTTTGGG AGCCTTGACC TGGGAAGACC ATTGCGAATT 172500 
TTTTCATTAT GTTTTGTCCT ATTGTTTTTT AATTGAAAAT TCTTCTTCGG TAATTAAAAT 172560 
CGGGTTTCGC CGATGGCGAG CTTACTTTTT CTTTGCTTCG CCAAAGAAAA AGTAAGCAAA 172620 
AAGAAAGGCG CCCCTACTTT TCCTTGTTTC CTTTGTTCCA TTGAATTTGC CTTAACGGAA 172680 
ATTT t CTGAA CTCGCTACGC TCAAACAGAC GAAAATTTCC TACAAATTCA ATTCCACAAA 172740 
GGTGGAAAAG AAGGGAACCC AATTTGTTCT TTATCCGATT T AAAGTG CGG TCAAAACTCA 17 2800 
AAATATTTTT TGTTCTTAAA ATGAATTTAA AATTAACCGC ACTTTTTAAA TAATGCTCTG 172860 
ATAAATTCGG GGCCCCATAT AAATCACCTT TGCGACTTTC AATTTTCAGG CAATTTGCGT 172920 
CTGTTTGAGC GCAGCGAGTT CAGCAAATTG CCGTTAAGAA AATTGAAACA AAGCAAAGAA 172980 
CAGCGATTTA ACTGGGGCGT GTTTTCTTTG CCTACTTTCT TTTACACGAG TAAAAGAAAG 173 040 
TGAGTCGCCG TCAGGCGAAC CCCTGACATT TAACACAAAA AGTGCGGTCA ATTTTTACCG 173100 
CACTTTAAAA TCATCAATAA TTATATCAAA TCAACTAAAA TCTCACTAAC GCTGAACCCC 173160 
AAGTCCAACC ACCGCCAAAG GCTTCTAATA ATAGTAACTG CCCACGTTGA ATACGGCCAT 173220 
CTCGAATAGC CTCATCTAAA GCAACAGGAA CAGTTGCTGC ACTGTTATTA GCGTATTTAT 173280 
CTAACGTTAC CACCACTTGC GACATATCCA TTTCTAATTT TTTAGCTGTT GCTGTAATAA 173340 
TACGTAAATT TGCTTGGTGT GGCACAAGCC AGTCTAAATC TTTTTTATCT AAATTATTGG 17 3400 
CTAAAAGTGT TTCTTCCACC ACATTTGAAA GTTCACGAAC TGCCAATTTG AACGTTTCGT 173460 
TACCTTGCAT CTCGATATAG CCAGATTTTT CTATACCACG TTCTGGCTGA GCTAAAACAA 173 520 
GGGCATTATT TTTATCTGCT GAAGCGTGTA AATGGGTGGA GATAATTCCT TCTTGTTCAC 173 58 0 
TCGCTTCTAA AATTACAGCA CCCGCACCAT CACCAAATAG CACAACAGTG CTGCGATCTG 173640 
TTTCATCTAA TTTACGAGAA TTGAGATCTG AGCCTATCAC TAAGGCTTTT TTCACTTTGC 173 700 
CTGCACGAAT AAATTGATCA GCTACGCTCA AAGCATAGAC AAAGCCTGTG CAAGCTGCCG 173 760 
C CAAATC AAA AGaAATCGCA TCATCAATAT TTAATAAACC TTGCACTTGG CAAGCCGcAC 173 82 0 
TTGGATAAGC ATGTGAGTGA CTTGTAGTTG CAACAATAAT CAGTTCAATA TCTTGAGGAT 173 880 
TAATTTGAGC AGCTTCGATC GCATTTTTTG CCGCTTCAAA TCCCATTGTT GCAACAGTTT 173 940 
CATCTTCCGC TGCGATACGA CGTTCACGGA T AC CAGAACG AGTGACAATC CATTCATCTG 174 000 
ATGTATCAAC C ATTTTTT CT AAATCCGCAT TTGTGCGAAT ATGGCTCGGC AGATAGCTAC 174 060 
CGGTGGATAA AATTCTACTA TTCATACTCA ATAAGATAAG CGATTAAAAA GGAAAAAATC 17412 0 
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TATGGACTAT CGCCAAGTAA GAGATAAGAC CACACCGGAT TATTTTC CG A TGTTAAAGGA 174180 

ATGCAGATAG TAATACGGGG ACCAACGTCC CCGCCCATCA CATCTAACGC CAAGGTTAGA 17424 0 

CGGTTCAAGT GAACACCTAT AAGTAAAAAA TTACTTATTG ATCACTTTAC GACCACGATA 174300 

GTAACCGTCT GCAGTTACGT GGTGGCGTAA GTGAGTTTCA CCGCTTGCTT TATCTACTGA 174360 

TACAGCAGCA GTTGTTAAAG CATCGTGTGA ACGACGCATA TCACGACGTG AACGAGATTT 174420 

TTTGTTTTGT TGAACAGCCA TTGGCTATAC TCCTAAATTT AATTTACTTT TGCTTTAAAT 174480 

TAG CT AAT AC AGCGAACGGG TTCGGTTTTT TTGCCAATTC TTCAGGCAAT ACGCCAAAAA 174 54 0 

CCTGTTCCTG CGCGG AC AC T TCACAGTGTT CAGATGAATG CATTGGTACA AGCGG CAAAG 174600 

CTAAAATTAG CTCATCTTCC ACCGTGCCAA TTAAATCTAT TTCGCCAAAC TCATTGAACT 17 1660 

CGATTGGCTC ATAAATTTCG GGCAAGTCAT CAGCCTGATC CCAATTAGCC ACTGGACTAT 174720 

AAGTAAACTC ACATTCCAAT GTTTGTTTGT AAGGTTCGTT ACAACGCTGA CATTCAAGAT 174780 

CGACATCAAT CTGTACTTTA CCTTTCATCA CCACTAATTT TTGTGGATCA ACATAAAACG 17484 0 

ATAATGTTAC CTGTGCATCG CTGAGCACGT TTACCACTGA TTCCGCCAAA CGCTCAAGCT 174900 

GATTTGCCGC ATAATAACCA ACATAATCAA GCCGTCTTTG AGCATCTTTT ATGGGATCAA 174960 

CAGTGAGGGG TAGTTTTACC TTTTGCATAG GGCGTGAATA TTACCTGTTG TTGCGAAGAT 175020 

AGTCAAAGGG AAATCACATT TTTTATTGGA TAACTCAATA AAGGAAATTA TCCGCTAACT 175080 

AAAAAGGTTG GAACTATAAC AAAAATTGCT ATAATCACGC CTCTTTTTTT AACAAGGAAA 17 514 0 

GATAATGATG ACTTCAAATT CTTATTGGCA ACGCTTAAAA GTGG CATTTC AATATGTCAT 175200 

GCCACAAATT TATTTAACTC AAATTG CTGG CTGGTTTGCT AAACAAAAAT GGGGCAAAAT 17 526 0 

AACACATTTT GTAATTAAAG CTTTTGCGAA AAAATACAAC ATTGATATGA GCATTGCGCA 17 5320 

AAAAGAACAA TTTTCTGATT ACGCAAGTTT TAATGAATTT TTTATTCGTC CGTTAAAAGA 17 5380 

AAACGCACGT CCAATTAATC AAAATCCAAC CGCACTTTGT TTACCAGCAG ACGGTCGCAT 17 5440 

TAGTGAGTGC GGTCATATTG ACGATAATCT TTTATTGCAA GCTAAGGGGC ATTTTTTCAG 17 5 500 

C CTAGAAG AC TTATTGGCAG AAGATAAAGA ATTAGTGGAA ACCTTTAAAA ATGGGGAATT 17 5 560 

TGTAACTACT TATCTTTCTC CTCGTGATTA TCACCGAGTG CATATGCCAT GCGATGCTAC 17 5620 

GCTACGCAAA ATGATTTATG TGCCGGGTGA TTTATTCTCT GTGAACCCAT TTTTAGCCCA 17 5680 

ACATGTACCA AATTTATTTG CACGTAATGA ACGTGTGATT TGTGTATTTG ATACTGAATT 175740 

TGGCACAATG GTACAAATTT TAGTGGGTGC AACCATCACT GCAAGTATTG GCACAACTTG 175800 
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GGCAGGCGTA ATTAATCCTC CACGCCACAA ' CGAAGTGAAA ACTTGGACTT ATGAAGGCGA 17586 0 

AAGTGCGGTC AAATTATTGA AAGGTCAAGA AATGGGGTGG TTCCAACTTG GTTCGACAGT 17 5 92 0 

AATTAATTTA TTCCAAGCAA ATCAAGTGCG TTTAGCTGAT CATTTAAGCG TTAATGAACC 17598 0 

TGTTCGCATG GGCGAAATCT TGGCATATAA AAAATAAACA GGATTTATTT CCATAAATGA 176 04 0 

CAGAAACAAA TCCTGAAAGT AGATATTTCT CACTTCTTTA AAGGCAGTAT AACCTTACTG 17610 0 

C CTTTTT CTT CTTTCATTTG AACAATTAAC GCATTGTTAC AAATTCCTCT GAACCTGTTG 17616 0 

GATGAATTGC CACCGTATTG TCAAAATCAG CTTTTGTTGC ACCCATTTTG ATTGCTACAG 176220 

CAAATCCTTG AATCATTTCA TCTACACCAA AACCAATACC ATGTAAACCC ACAACTTTTT 1762 80 

CATCTTTACC CACACAAACT AATTTCATTT TGCACGGTTG GCGATGTTGA GTTACCGCAG 176 34 0 

TGTACATCGC TGTGAAAGAA GATTTATATA C CTT AAC ATT TTCTGCGCCG TACTGCTCAA 1764 00 

TCGCTTGCGG TTCAGTTAAA CCTACAGTGC CGATAGGCGG ATGGCTAAAT ACAACGGTTG 17646 0 

GAACTAAACT GTAATCTAAA T A t TC AGTCG GTTTATTATT AAATAAACGC TCAGAAAGGC 176 520 

GACGACCTGC TGCAACTGCA ACTGGTGTTA ATTCAATGCC GTTTTCGATA ATATCGCCTA 176 580 

CCGCATAAAT GCCTTTCACA TTAGTATTTT GATATTTATC TACTTTGACA TAG CC ATGTT 176 64 0 

CGTTCGTTTC TACGCCAGCA TTTTCTAAGC CAATTTTATC TGTCGTTGGA ACACGACCCG 176 7 00 

CAGCCCAAAT AACGCAATCT ACGGTAACAT CAGATTGACC AT CACATCTT ACAGTAAGTG 176 760 

AACCATCTGC ATTTTTTACA ATTTCAGATG GGGTAGAATT GGTATGTAAT TGAATTCCAT 176 82 0 

CTTGCGCAAG CACTTCCACT AATGTTTCTA CGATTAATGG ATCCTGATTA CGCATTGGCG 1768 8 0 

CATGGCGACG CACTAATAAA TGTGTTTCCA CGCCTAAGCT ATTTAATACG CCAGAAAGTT 176 94 0 

CAACAGCAAT ATAGCCTGCA CCAATAACAG CAGCACGTTT TGGTAATTCG GTTAATGCAA 177 000 

AG AAACC AT C TGAATCAATG CCGTATTCTT GTCCTTTAAT ATTTGGACGA TATGGACGAC 17 706 0 

CACCAGTTGC GATTAAAATA TGATCTGCGG TTACTTGTTC TTTTGTACCA TCAGCAAGTG 177120 

TTACTTCAAT GGTATGGGCA TCGACAAATT TTCCGAAACC ATTAATTACA TCAATATTAT 17 7180 

TTTTCGCTAA TACATTATTA TAAGATGTAT GGATACGACT AATATAGGCT TGGCGACTTT 17 7240 

CAATCAGTTT TGAAAAATCA AATTTTTTCA CTTCAACATC AAAACCATAA TCTGGCGCAT 17 73 00 

AATTGTTGAT TGCTTCTGCA ATATGCGCAC CATAAAACAT CACTTTTTTA GGTACACAAC 177 360 

CGACATTTAC ACAAGTTCCG C CAAG ATGTT TTGCtTCAAT GATTGCACAT TTTTTtCCAT 177420 

AGCTTGCTGC T CGATTTAG A GACGCAATAC CGCCACTGCC ACCGCCAATA GCAATATAAT 1774 8 0 
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CATAATGTTT AGTCATAGTT TGAACCTTTT TTAAACTAAA ATTTTTTTGT ATTCTGCCTA 177 54 0 

ATTTTTATAA AGAATGCTAT AAATTGATGT AATTGGTTTT ATGAATATTT TTTACTCTTT 17 76 00 

TGAACAGATG ATTAGTTTGA AATGATTTTA TGTGAAGCCT AGTATGTAGA ATGAAATAGC 17 766 0 

GTGATAGAAA AAGAGGAAGA TTGTTCAATC TTCCTCACTG TAGATATGTG ATTAATTAGA 17 7720 

ATATTGGTTT ATCGCAGCCG CAACTTCTTG ATCTTGATAA ATATTGTTCA CAATATCATT 17 7780 

AAAAGTTTGA GATAGCACTT TATGAATTTC ATCATTGCCC GCATTAAATA CGCCCTCTTG 17784 0 

TGAGTGCGTG ACATTAAATG ATTTATTATA CGAACCTTTC GCACCTTGTA CATAAACTGT 177 900 

CGCTTGAATT TTGGTATTAA GTTTATAACG AAGATTACCT TGTTCTACTT GCGTACCAAA 177 96 0 

TTCACGCACA TCCACAGTTA CCCACGCATT TGAACCATTT AATTGCCCAA CTCTAAAACC 17802 0 

TTTACTAATT AAATTTTGCT G CATC AC TTG CTGAAATAAT TGTGTAACAC TTGGGGATGC 17808 0 

ATTTAATTTA ATCAGTTCCC CGTGTTTCGT ATAACTCGCA ATCTCTTGTA TTGCACGGCT 17814 0 

ATCTTTTGTT G T C AC AG AT A CAACGGCAGT TCGATTAACA TTCATTGAAG CACTTGGTGC 17 82 00 

AGGCGGTGTA AATG CTAAT A TATTTGATTG TGCTTGGCAG CCTGCAAGAA ATAATGT AG C 178260 

CGCTAGAATA CCTACTGATG ATAATGTTTT AATTTTGTTT ACTAATGCCA TAATTTCCCC 178320 

ATAAATCAAT TAGAATTGTG CTATTCTTAC AAACTTCCTT ATTAATGTAT AGCGTTTCAT 178 3 80 

TTTAGTTTTT GTGGAGAACT TATAATGTCA ATTCAACAAA TTATTGAGCA AAAAATTCAA 17844 0 

AAAGAATTTC AACCGCACTT TTTAGCAATA GAAAATGAAA GCCATTTACA TCACTCTAAT 178 50 0 

CGAGGTTCTG AATCACATTT TAAATGCGTT ATTGTCAGCG CAGACTTTAA AAATATTCGA 17 8 56 0 

AAAGTACAAC GTCATCAGCG TATTTATCAA TTATTGAACG AAGAATTAAA TCATAGTATT 17862 0 

CACGCATTAG CATTACATTT ATTCACTCCA GAAGAATGGA AAGCACAAAA TGAAACTGTA 1786 8 0 

CCGCACTCAA CAAAATGTGC AGGTATTGGT CGTTAATTTT GTTATAAAAA TGTAACATTT 17874 0 

TTGATATATC G CAAAAAGTT CTAACAAAAA ACCCTCTGTT TAAGTAGCGA AAAACAGAGA 178 80 0 

TTTTCGCTAG AATGAGGCGT TTAATTTAGT CCAAATTCGA TTTCTCGGTA AAATTATTAA 17 886 0 

CTGTGAAATC AATCCATTTG AGTTTTTTAT TTTGTGACAT TAATGTTTTT TTAGCGGTTT 178 92 0 

GTCACACTCA AACGTGAGAT TGCTTCTAGG GTATTTTTAC TAT ATT AG C A ATTGTATTTT 178 98 0 

TAACCTTGAA AAGTAGTGCA AATAGTATGA TTACAATTAA GAAAGGTTTG GATCTCCCAA 17904 0 

TTGCGGGAAA ACCAGCACAA GTAATCCATA GCGGCAACGC TGTGAATCAA GTTGCGATTC 17 9100 

TAGGTGAGGA GTATGTGGGG ATGCGTCCTT CAATGAAGGT GCGCGAAGGC GATGTTGTGA 17 916 0 
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AAAAAGGTCA AGTACTTTTT GAAGACAAAA AAAATCCTGG TGTAATTTTT ACAGCCCCTG 179220 
CAAGCGGTAC CATCACTGCA ATCAATCGTG G CG AAAAACG TGTATTACAA TCTGTTGTTA 179280 
TTAATGTAGA AGGTGATGAA AAAATCACTT TTGCAAAATA TAGCACAGaA CAATTGAATA 179340 
CACTTTCTTC TGAACAAGTA AAACAAAACT TGATAGAATC TGGCTTATGG ACGGCATTAC 179400 
GTACTCGTCC ATTTAGTAAA GTTCCATCCA TTGAAAGTGA AGCATCTTCT ATTTTTGTAA 179460 
ATGCGATGGA TACCAATCCA TTAGCAGCCG ATCCCTCTGT CGTATTAAAA GAATATTCG C 17*520 
AAGACTTCAC TAACGGTTTA ACCGTATTAA GTCGTCTATT CCCTTCAAAA CCATTACACT 179580 
TATGTAAAGC AGGCGATTCT AATATTCCAA CTGCTGATCT TGAAAATTTA CAGATTCATG 179640 
ATTTTACTGG TGTTCATCCA GCTGGTCTTG TAGGGACGCA TATTCACTTT ATTGATCCTG 179700 
TCGGTATTCA AAAAACTGTA TGGCATATCA ATTATCAAGA TGTGATCGCT GTAGGTAAAT 17976 0 
TATTTACAAC AGGCGAACTT TATTCAGAAC GAGTAATTTC TCTTGCAGGT CCTCAAGTGA 17 9820 
AAGAACCTCG TTTAGTACGT ACAACAATCG GTGCTAATCT TTCTCAATTA ACCCAAAATG 179880 
AATTAAGTGC AGGTAAAAAT CGTGTAATTT CTGGCTCAGT TCTTTGTGGT CAAATAGCAA 179940 
AAGATTCACA TGATTATTTA GGTCGCTATG CTTTGCAAGT ATCAGTAATC GCTGAAGGTA 18 0000 
ATGAAAAAGA ATTTTTTGGT TGGATTATG C CACAAGCCAA TAAATATTCT GTGACTCGTA 180060 
CCGTATTAGG GCCACTTCAG TAAAAAATTG TTTAATTTTA CTACTTCAGA AAATGGTGGC 180120 
GAACGTGCAA TGGTG CCAAT CGGTAGCTAT GAGCGCGTAA TGCCATTGGA CATTTTACCT 180180 
ACATTATTAT TACGTGATTT AATCGTGGGC GATACCGATG GTGCTCAAGA GTTAGGTTGT 180240 
CTGGAATTAG ACGAAGAAGA CTTGGCGTTA TGTTCTTTCG TTTGTCCAGG CAAATATGAA 180300 
TACGGTTCAA TCTTGCGTCA AGTCTTAGAT AAGATTGAGA AGGAAGGTTA AAAATGGGTT 18 036 0 
TGAAAAATCT TTTTGAAAAA ATGGAACCCG CGTTTTTACC CGGTGGTAAA TACAGCAAGC 18 0420 
TTTATCCGAT CTTTGAATCG ATTTATACCT TGCTTTATAC ACCAGGTACG GTAACGCACA 18 0480 
AAAACACTCA CGTTCGTGAT GCGTTAGATT CAAAACGTAT GATGATTACG GTTTTCCTTG 180540 
CGTTGTTCCC TGCGATTTTC TATGGGATGT ACAATGTGGG TAACCAAGCG ATTCCAGCGT 180600 
TAAATCAATT AGGCAATTTA GATCAACTAA TTGCTAACGA TTGGCACTAT GCTCTTGCAA 180660 
GTTCATTAGG TTTAGATTTA ACTGC CAATG CAACTTGGGG CTCTAAAATG GCTCTTGGAG 18 072 0 
CAATTTTcTT TTTACCAATT TACTTAGTGG TATTTACCGT TTGTACTATT TGGGAATTGT 180780 
TATTCTCTGT GGTTCGTGGT CACGAAGTAA ATGAaGGAAT GTtCGTTTCA ACCATTTTAT 180840 
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TTGCCTTGAT CGTTCCACCA ACATTGCCAC TATGGCAAGC CGCATTAGGT ATTACTTTTG 180900 
GTATCATTGT TGCAAAAGAA ATTTTTGGTG GCGTAGGTCG TAACTTTATG AACCCTGCAC 180960 
TTGCTGGTCG TGCGTTCTTA TTCTTCGCAT ATCCAGCTCA AATTTCAGGC GATACGGTTT 181020 
GGACTGCCGC AGATGGTTTC TCTGGTGCAA CCGCACTTTC ACAGTGGTCA CAAGGTGGTC 181080 
AAGGTGCATT GCAACATACC GTAACAGGGG CTCCAATTAC TTGGATGGAT GCTTTTGTTG 181140 
GTAACTTACC TGGTTCAATG GGGGAAGTTT CTACCCTTGC AATTTTAATT GGCGGTGCTG 181200 
TTATTGTATT CACTCGCATT GCAGCTTGGC GCATTATTGC TGGTGTGATG ATTGGCATGA 181260 
TTGCAACCTC AACCTTGTTT AACCTAATTG GCTCAGAGAC TAACCCTATG TTCTCAATGC 181320 
CTTGGCATTG GCACTTTGTT TTAGGTGGAT TTGCATTGGG TATGGTATTT ATGGCGACCG 181380 
ATCCTGTTTC AGCATCTTTT ACTAATACTG GTAAATGGTG GTACGGTGCA TTAATTGGCG 181440 
TAATGGCTGT ATTAATTCGT ACTGTAAACC CAGCCTATCC AGAAGGAATG ATGTTAGCGA 181500 
TTTTATTTGC AAACTTATTT GCACCGATTT TCGACTACAT CGTTGTTCAA GCAAATATCA 181560 
AACGTCGGAG AGCAAGAACA AATGGCTAAG TTTAATAAAG ACAGCGTAGG CGGCACAATC 181620 
CTTGTTGTGC TACTACTGAG TTTAGTTTGT TCCATTATTG TTGCTGGTTC CGCAGTAATG 181680 
TTAAAGCCTG CACAAGAAGA ACAAAAGTTA CTCGATAAAC AAAAAAATAT CTTAAATGTA 18174 0 
G CTGG TCTTT TACAAGAAAA CACGAATGTA AAAGAAACTT ATGCAAAATT TATCGAGCCC 181800 
CGTTTCGTTG ATTTAGCAAC GGGTGAATAC ACACAACAAG CAGATGATAG CCAACAAGCA 181860 
ATTCCTGCTG ATGCGGATAA AGCACGTATT CGTTCTCGTA GCAAAACAAC TGAAGTTTAT 18192 0 
CTTGTAAAAG ATGAACAGGG TCAAACTCAA CAAGTTATTT TACCAATTTA TGGCACTGGC 18198 0 
TTGTGGTCAG TGATGTATGG CTTAGTATCT GTTCAACCAG ATGGTAATAC TATTAATGGA 182040 
ATCACTTACT ATCAACACGG AGAAACACCG GGATTGGGCG GAGAAATTGA AAATCCAAAC 182100 
TGGGCAAGTC TATTTAAAGG TAAAAAATTA TTTGATGAGC AACATCAACC TGCCATTCGT 18216 0 
ATTGTGAAAG GTCAAGCTCC ACAAGATGAA CAT AG CATTG ATGGTTTATC TGGTGCAACT 182220 
TTAACAGGTA ATGGTGTTCA AGGTACCTTT AATTATTGGT TCAGCAAAGA TGG CTTTGGT 18228 0 
CCATATCTTG AAAAACTTCA TTCAGGAGCA AACTAATGTC TGGAAAAACA AGTTATAAAG 182 34 0 
ATCTGTTGTT AGCGCCAATT GCTAAAAACA ACCCTATTGC ATTGCAAATC TTAGGGATTT 182400 
GTTCTGCATT AG CTGT AACA ACAAAATTAG AAACTGCTTT TGTTATGGCG ATTGCAGTTA 18246 0 
CCTTAGTAAC TGGATTATCA AACTTATTTG TTTCCCTAAT CCGCAATTAT ATCCCTAACA 182 52 0 
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GCATTCGTAT TATTGTACAG CTCGCAATCA TCGCTCGCTT GTtATCGTAG TGGATCAAAT 182 58 0 

TCTAAAAGCC TATGCTTATG GTTTATCTAA ACAACTTTCA GTATTCGTTG GTTTGATTAT 18 264 0 

TACTAACTGT ATCGTAATGG GGCGTGCTGA AGCATTTGCG ATGAAATCCC CTC CAGTTG A 182 700 

AAGTTTTGTA GATGGTATCG GAAATGGCTT AGGTTATGGT TCAATGTTAA TTATCGTAGC 18 276 0 

ATTTTTCCGT GAACTTATCG GTTCAGGTAA ATTATTTGGT ATGACGATTT TCGAAACTAT 182820 

TCAAAATGGT GGCTGGTATC AAGCAAATGG TTTATTCCTA CTTGCACCAA GTGCATTCTT 1828 80 

TATCATCGGA TTTGTGATCT GGGGATTAAG AACTTGGAAA CCAGAACAAC AGGAGAAGTA 182 94 0 

ATCAATGGAA CATTATATTA GCCTATTTGT TAAGGCAGTC TTCATTGAAA AT ATGG C ACT 183000 

TTCTTTCTTC TTAGGAATGT GTACTTTCTT AGCGGTATCT AAAAAGGTGT CCCCTGCGTT 183 060 

TGGTCTTGGG ATTGCGGTAA CTTTCGTTCT TGGGATCGCA GTGC CAGTAA AC C AATTAAT 183120 

TTATGCAAAT GTACTAAAAG AGAATGCTTT AATTGAAGGT GTAGATTTAT CATTCTTAAA 183180 

CTTCATCACT TTCATTGGGG TTATTGCTGG TTTAGTGCAA ATTCTTGAAA TGGTATTAGA 18 3 24 0 

TAAATTTATG CCATCTCTTT ATAACG CATT AGGGATTtCT TACCTTTAAT CGCAGTAAAC 18 3 3 00 

TGTGCGATCT TTGGTGGGGT ATCTTTCATG GTTCAACGTG ATTACAATTT C C CTGAATCT 18 336 0 

ATCGTGTATG GTTTCGGCTC AGGTTTAGGT TGGATGTTAG CGATTGTAGC GCTTGCTGGC 18 3420 

TTAACGGAAA AAATGAAATA CGCTGATATT CCTGCTGGAT TAAAAGGTTT AGGTATTACC 18 3480 

TTT AT CTCTG TTGGTTTAAT GGCACTAGGC TTTATGTCTT TCTCTGGTAT TCAATTATAA 183 540 

GGAGTGTAAT CAATGAGCGA TTCAGTAATT CTTGCACTCG GTATTGCCGC ATTCACGGTT 18 36 00 

ATTGTATTAG TGTTAGTGGC AATCATCTTA TTTGCGAAAT CAAAATTAGT CGATTCTGGT 183660 

GATATTACTA TCGACATCAA TGACGATCCT GAAAAAGCGA TCACATTACC TGCAGGTGGC 18 3 720 

AAATTATTAG GGG CTTTAGC AAGTAAAGGT ATTTtCGTGT CTTCTGCTTG CGGCGGCGGT 18 3 78 0 

GGCTCTTGTG GTCAATGTAT TGTTAAAGTG AAAAATGGTG GCGGAGAAAT TCTTCCAACC 183840 

GAACTTTCTC ACATTAACAA ACGTGAAGCA AAAGAGGGAT ATCGCTTAGC TTGTCAAGTT 18 3 900 

AATGTAAAAG GTAATATGGA AGTTGAACTT CCAGAAGAAA TCTTCGGCGT GAAAAAATGG 18 3 96 0 

GAATGTACTG TTATTTCTAA CGATAACAAA GCAACCTTTA TCAAAGAGCT TAAATTGGCT 184 02 0 

ATTCCTGAAG GCGAGGAAGT GCCTTTCCGT GCAGGTGGTT ATATC CAAAT CGAAG CTGAA 184 080 

CCACATGTGG TGAACTATAA AGATTTCGAT ATTCCAGAAG AATAC CACG A AGACTGGGAT 18414 0 

AAATATGATT TATGGCGTTA TGTCTCTAAA GTGGACGAGC ATATTATCCG TGCTTACTCA 1842 0 0 
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ATGGCTTCAT ACCCTGAAGA GAAAGGTATC ATTATGCTTA ACGTTCGTAT TGCGACTCCA 1842 60 

CCACCACGCC AACCTGATGC GCCTCCGGGT CAAATGTCTT CTTACATTTG GTCATTAAAA 184320 

G C AGG AG ATA AAGTCACAAT TTCTGGTCCA TTTGGTGAAT TCTTTGCGAA AGAAACTGAT 184380 

GCAGAAATGG TATTTATCGG TGGTGGTGCG GGTATGGCTC CAATGCGTTC ACATATCTTC 1844 4 0 

GACCAATTAA AACGTTTACA TTCTAAACGT AAAATGTCGT TCTGGTATGG TGCACGTTCT 184 5 00 

AAACG TG AAA TATTCTATCA AGAAGACTTT GACCAATTAC AAGCTGAAAA TCCAAACTTT 184 56 0 

GTATGG CACG TGGCACTTTC AGATG CATTG C CTG AAGAT A ATTGGACTGG CTATACAGGC 18462 0 

t TTATTCACA ATGTACTTTA TGAAAACTAC TTGAAAAATC ATGAAGCACC AGAAGACTGT 184680 

G AAT AC TATA TGTGTGGGCC TCCAGTGATG AATGCAGCTG TAATTAAAAT GTTGAAAGAT 184 74 0 

CTTGGTGTTG AAGATGAAAA CATTTTATTA GATGACTTTG GTGGCTGATT TTAGTTAATC 184 8 00 

AATTCATCAT AAAAGCTGcC CGCACTTTGT GATAAAGTGC GGnC s AAATT TAAGATGTTT 18486 0 

TTAAACATTT TCTATGATTA CAGGTAAGTT TTGTTGAGTT TGCCAGTAAT TCACCAATTT 184920 

AGGAAATCCA GATGAAAAAA TTAATAAGCG GTATCATAGC GGTAGCAATG GCATTAAGTC 184 98 0 

TTGCTGCTTG TCAAAAAGAA ACAAAAGTTA TCTCTTTAAG CGGTAAAACA ATGGGGACAA 18 504 0 

CTTATCATGT TAAATACCTT GATGATGGTT C AAT AAC AG C AACATCTGAA AAGACGCATG 185100 

AAGAAATTGA AGCAATCTTA AAAGATGTAA ACGCGAAAAT GTCCACTTAC AAAAAAGATT 18516 0 

CGGAATTGAG CCGCTTCAAT CAAAATACCC AAGTGAACAC ACCGATTGAG ATTTCAGCAG 185220 

ATTTTGCCAA AGTATTAGCT GAAGCGATTC GTTTAAATAA AGTGACCGAA GGCGCGTTGG 18 52 8 0 

ATGTAACTGT TGGCCCTGTC GTGAATTTAT GGGGATTTGG TCCTGAAAAA CGCCCCGAAA 18 534 0 

AGCAAC CTAC ACCAGAACAA TTAGCTGAAC GCCAAGCGTG GGTTGGCATT GATAAAATTA 18 5400 

CCCTAGATAC TAACAAAGAA AAAGCTACAT TAAGTAAAGC ACTTCCTCAA GTTTACG TAG 18 5460 

ATTTATCGTC AATTG CT AAA GGCTTTGGCG TTGATCAGGT GGCTGAAAAG TTAGAACAAT 18 5 52 0 

TAAATGCTCA GAATTACATG GTTGAAATCG GCGGTGAAAT TCGTGCGAAA GGAAAAAATA 185580 

TTGAAGGTAA ACCTTGGCAG ATTGCCATTG AAAAACCAAC T A C AAC AGG C GAAAGAGCGG 18 5640 

TTGAAGCGGT CATTGGATTA AACAATATGG GAATGG C AAG TTCTGGCGAT TACCGTATTT 18 5700 

ACTTTGAAGA AAATGG C AAA CGCTTTGCGC ACGAAATTGA TCCGAAAACA GGTTATCCAA 18 5 76 0 

TTCAGCATCA TTTAGCCTCA ATTACGGTAC TTGCACCAAC CTCAATGACT GCAGATGGCT 18 582 0 

TATCTACAGG GTTATTTGTG CTGGGGGAAG ACAAGGCGTT AGAAGTGGCT GAGAAAAATA 18 5880 

-77.112- 

SUBSTUUTE SHEET (RULE 26) 



BMSDOCIO <WO 9S33276A1 I > 



WO 96/33276 PCI7US96/05320 

ATCTTGCCGT TTATTTAATC ATTAGAACAG ATAATGGTTT TGTTACCAAA TCATCCTCTG 18 594 0 

CTTTCAAAAA ATTAACAGAA ACAAAAGAAT AGTTATGCAA ACTTTATTTT TTACTTTAAT 186000 

CGCTTTCGTC GCAATTATAT TGTTGATGTC TATCGGATTT ATTATCAAAA AACAAAGTTT 18606 0 

AAAAGG CAGC TGCGGTGGTT TATCTACCCT TGGTATTGCC AAAGCTTGCG ATTGTGATAA 18612 0 

ACCTTGCGAC ACGCnTCAaT CAAAATTAGA TGCAGGCGAT G r ACAAGCG A AAGCTGAATA 185180 

TGAGCAAAAA TTTGCGAAAA AAGATGATGA CTCGCAATTT TATCAAGTGA AATAACTTGC 18624 0 

TCAAAAATGA CCGCACTTTA TGTGCAAAAT AAAATTAACG GCAAAATTAT TTTGCCGTTT 186300 

TCTCATTCAA TGACCAATGT GCTTCGCCTT AACCGAAGTT ACGGGAGAAC TATGTTAATT 186360 

TCAAATACTT ATAATCAACA CTTTCCTCAA TTGACGCAAG AGCAGCTTGC GAGAAATGCG 186420 

ACAAAAAAAG TCATTTGTGG TATGTCTGGT GGCGTGGATT CTTCTGTGTC AGCGTTTATT 186480 

CTTCAACAGC AAGGCTATCA GGTGGAAGGC CTGTTTATGr AAAACTGGGA AGAAGATGAT 186540 

GATACGGATT ATTGTACTGC CGCAGCTGAT CTTGCAGATG CTCAGGCTGT ATGTGATAAG 18 6600 

TTGGGGATCA AACTACATAA AATTAATTTT GCGGCAGAAT ATTGGGATAA TGTCTTTGAG 186660 

CATTTTTTAA CCGAATATAA AGCAGGGCGC ACGCCGAACC CAGATATTTT GTGTAATAAA 18672 0 

GAAATTAAAT TTAAAGCATT TTTAGAATAT GCAGCTGAAG ATCTCGGTGC AGATTATATT 186 780 

GCAACAGGGC ATTATGTACG TAGAGCGGGA GATAATGAAA ATGCAAAACT ATTACGTGGT 18 6840 

TTAGATCCCA ATAAAGATCA AAGTTATTTT CTTTATACTT TAAGCCATAA ACAAGTGGGG 186900 

CAAAGTTTAT TCC CCGTTGG TGAAATCGAG AAGCCCATTG TTCGTGCTAT TGCTGAAGAT 186 960 

CTTGGCTTAA TTACGGCGAA GAAAAAAGAT TCTACAGGGA TTTGTTTTAT TGGTGAGCGT 187 020 

AAATTTAAGG ATTTCTTAGC ACGCTATTTA CCAGCTCAAC CAGGTAATAT TCG CACTGT A 187080 

GATGATGAAA TTATTGGTCG CCATGATGGA TTAATGTATC ACACATTGGG ACAACGCAAA 187140 

GGATTAGGCA TTGGTGGTCT AAAAAATGCG GGAGATGAAG CTTGGTATGT AGTAGATAAA 187200 

GATGTAGAAA ATAATGAACT TATTGTCGCA CAAGGTCACG ATCATCCTCG TTTATTTTCA 18 7260 

AAAGGATTGA TTGCCAGCCA ATTACATTGG GTTGATCGCG AACCAATTCG AGAGTCATTA 18 7320 

CGTTGCACGG TGAAAACGCG TTATCGCCAA CAAGATATTC CTTGTGTGAT TGAACCAATT 187380 

GATGATGAAA CCATTCGAGT TATTTTTGAT GAACCTCAAT CAGCAGTAAC CCCAGGGCAA 18744 0 

TCTGCCGTAT TTT AC CTTGG CGAAGTTTGT TTGGGTGGCG GTATTATCGC AGAAAGAATA 18 7 500 

TAAAACAAAG ATGACTAAAA GCGCGTAAAA TGCGCTTTTA AACTATTTAT TCAAACCAAA 187 560 
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TGACGCTCGC CATTCGTCCC GTTTGATTGT CCCTGCGATA AGAAAAGAAC TTTTCTTTTT 18 7620 
CGTTGAATGT ACAGTGATTT CCACCATAAA TTTGCGTAAT ACCTAGATTG TTTAATCGCT 187680 
GAGTTGCAAT TTGATAAAGA TTACTCAGGT ATTTACCTTC TTCGATTGCA TCAGGTTGAA 187740 
AGGCGAGTTT GGCTTTTTCA TCTACTACAA CAAACTTTTC TACAACATCA ATCCCAACTT 187800 
GAAAGGCCTT TGGACCTATT GCAGGGCCAA ACCACGCGAT AATATCTTCA GGTTTAGCTT 18 78 SO 
GAAAATATTT CACTGTTTCT TCTAGTACAC CATCGCATAA ACCACGCCAG CCAGCATGTG 187920 
TTGCAGCCAC TTCATTGCCA GATGTTGTAG TGAATAGAAC TGGCAAACAG TCTGCCGTCA 18 7 980 
TAACAACGCA AACTTGATTG GGAACATTTG TATAAACTGC ATCCGCTTCA AGATTTTGTC 188040 
CTGAATAAGG TAATTGAATC ACTCGAGTAC TGTGTGTTTG AGTTAGAAAT ATAGGTGTTT 18 8100 
GTGGCAAACC AAATTTTTCT ACTAATAAAG TGCGGTTGGT TTTTACCGCA CTTTTGTTAT 18 816 0 
CACCGACATG ATCGCCTAAG TTGAAACTCA AATAAGGCGC TAAGCTCACA CCCCCTTCAC 18822 0 
GAGTAGTGGT AAAGGCATGA ATATTCTTTG GAACATTCCA ATTGGGGTTA ATTGCTTGCA 18828 0 
TATTAATAAT CTAACTCATC TTGATGTTCG AGATAGTCGG CTTTGAGGGC ATTAAGCAAT 18834 0 
TCAACGAAAT CATCTGGTAA TGGTGCATAC CATTCCATCA TCTCCCCTGT GATTGGATGA 188400 
GCTAAGCGTA ACATGACCGC ATGTAAAGCT TGGCGTTTGA AATTACGCAA TACTTCCATA 18 846 0 
AAGTCTTCAC TCGCATTTTT TGGTGGACGA GGGCGTCCGC CATAGGTTTG ATCGCCTAGC 188S20 
AGTGGATGGG CAATATGTGC CATATGAACG CGGATTTGAT GGGTGCGCCC CGTTTCTAAA 188 580 
CGTAAACGTA AACGGGTGTA GTTACGGTAA TTTTCCATAA TACGATAGTG GGTAACAGCG 18864 0 
GGTTTTCCCA TTGGATGAAC CGCCATTAAA GTACGTTTGG TTGCATGACG AGCCATAGGT 188700 
TGATCTACTG TGCCACCTTT AGTCATAATG CCTGAAGCGA CTGCTTCATA TTCGCGCGTA 188 76 0 
ATTTTACGTT TTTGTAGGTC ACGTACTAAC TTAGTTTGAG C AG GAATGGT TTTAGCaACA 18882 0 
ACCATTAGAC CTGTTGTATC TTTATCTAAT CGATG C AC AA TTCCTGCTCT CGGTACTTCT 188 88 0 
ACAATCGGCG GATAATGATA GAGTAACGCA TTGAGCACGG TTCCATTTGG ATTGCCAGCA 188 94 0 
CCAGGGTGTA CCACAAGATC TTTTGGTTTG TTAATCACAA TAATATCATC ATCTTCATAC 1890 0 0 
ACGATATTCA ATGGAATATT TTCTGCTTCA AAACGTGTTT CATCTTCCAC TTCAACAATA 18 906 0 
ATCTCTATTC TCTCTCCTCC AAGCACTTTC TCACGCGGAA TATTAGTAAT GCGGTCGTTA 18 9120 
AGTTTTACGA GATCGGCTTC AATCCATGTT TTTAATCGGG AACGAGAATA TTCAGGGAAC 18 9180 
AACTCTGCAA GTGTTTGGTC TAAACGTTGT CCCATTTGTT CTGGCTGCAC TTCAGCCGAA 18 9240 
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AGGGTAATTT 


GTGGCATAAA 


TAGCTCTCAA 


AAATTTTAAA 


AGTTGGCTTA 


TTGATTAATG 


189300 


i 1 LTATACAA 


TAGCGGACAA 


TTGTATCTTA 


TTTATTGGTT 


TTCGTAAAAA 


TTAAGGAAAA 


189360 


AACAATGCGT 


AAAATAAAAT 


CTTTGGCATT 


GCTTGCCGTG 


GCAGCATTGG 


TTATTGGATG 


189420 


TTCAAGTGGT 


TCAAAAGATG 


TTGAACAAGC 


ATCGGTAAAT 


GAATTATACA 


CAAAAGGTAC 


189480 


AACATCATTA 


CAAGAAGGTA 


GTTACTCTGA 


AGCCATTCGC 


TATTTAAAAG 


CGACGACTGA 


189540 


GCGTTTTCCA 


GGT AG CGTTT 


ATCAAGAACA 


AG CAATGTTA 


GATTTAATTT 


ATGCAAATTA 


189600 


TAAAACACAA 


GATTATACGC 


AAGTATTATT 


GATGGTAGAT 


AGCTTTTTAC 


ATCAATTTAC 


189660 


ACAAAGCCCA 


AATCAAGCCT 


ATGCGGTATA 


TATGGCTGGC 


TTAACTAATG 


CAGCAACAGG 


189720 


CGATAATTTC 


ATTCAAGATT 


TTTTTGGAAT 


TGACCGTGCT 


ACACGTGAAA 


CAACCTCTAT 


189780 


GCGTACCGCT 


TTTTCTAATT 


TC C AAAACTT 


AGTTCGTGTA 


TTCCCTAATA 


GCCCTTACTC 


189840 


ACAAGATGCT 


TTAGCTCGTA 


TGGCTTATAT 


TAAAGATGCA 


CTGGCTCGTC 


ACGAATTAGA 


189900 


GATTGCAAAA 


TTCTATGCAA 


AACGTAAAGC 


GTGGGTAGCA 


GTTGCAAATC 


GTGTGGTAGG 


189960 


AATGTTAAAA 


CAATATC CTG 


ATACTAAAGC 


aACTTATGAA 


GGATTATTTC 


TGATGCAGGA 


190020 


AG CTTATGAA 


AAAATGGGTT 


TAACCGCATT 


AG CAAATG AT 


ACTCAAAAAA 


TTATTGATGC 


190080 


GAATAAAGAT 


AAAACTTTTG 


CACCAATTGA 


AAAACCAAAC 


GAGCCAGACT 


TAAAAGTGCC 


190140 


AGCAGTAAAA 


TAGCAATAAC 


TGCAGAAAAA 


TCGAATACTC 


TCTGCACCCT 


AAAAATTGAA 


190200 


TAAAATTTAA 


ACTAATTAAG 


GTGCAGATTT 


TTTTATTTTT 


TATCAACTGA 


AAATGCTTTA 


190260 


CCATAATTTG 


CACAAATAAT 


CTGAGCAGCA 


TTATAAAGCG 


TTGTATCAGG 


TGTTAAACTT 


190320 


AACGTATGTT 


TCTTTTGCTT 


TTTAGGTGCT 


GCCCGCAAAC 


CCTGTCCCCA 


AAATTCATCA 


190380 


TAGAAATCAG 


TTCGTATTTG 


AGTTAAATGA 


TAATTTG CAC 


AATTCAAAAT 


CTTATACTGA 


190440 


CGAACAGAAC 


GTGCATAACG 


TTTAGGCTCA 


GGATAAACAT 


ACAATCCCCT 


ATCTAAATTC 


190500 


ACCACAGCAT 


CAAAATGTAC 


AATTTGTGGC 


TCTTGGTTAT 


CCACCCAGAT 


CGATTCACTA 


190560 


TCGATGTAAT 


AATTCACATT 


CTTTACCAAA 


CGTATATATC 


CGCTTCGTAC 


ATCAGTCGGC 


190620 


GGTGCCAGCT 


TCACATCATT 


TTGTTCAGCC 


TTTTGGATTT 


GAGCAGAACA 


AGCGGTAAGT 


190680 


AACCCAAGTG 


ATAATGTTAA 


AATAATTTTT 


TTCATTATTT 


ACTCCTTATG 


TTAGCTGACA 


190740 


TTTATTCTAG 


AATTTTACAG 


TGTGTCCATA 


ACCTTCTAGG 


ATTGTTTTAA 


TATGTTCTAA 


190800 


GGATTCTTTA 


GTTGGCGGTA 


ATACATTTTC 


AAG CTCATAA 


TCTAACCCAA 


GGGTTTTCCA 


190860 


TTTATGCACA 


CCTAATCGAT 


GATAAGGAAG 


AAGTTCAACT 


TTTT CAATAT 


TGGTCATACC 


190920 
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TTCAATAAAC TGACCTAATA AATGCACATC GTGATCGCTA TCAGTATAAC CAGGAACCAC 190980 
AACATAACGA ATCCAGGTAT GTTGATTACG TTTTTGCAAA TATTTTGCAA ATTCAAGGGT 191040 
ACGTTTATTT GGCACCCCAA TAAGATTTTG ATGAACTTGA TCATTAAGTT CTTTTAAATC 191100 
AAGTAAAACA AGATCTGTTA CATCTAATAA TTCATCAATA ATATGATCAT AATGACGTAC 191160 
AAAACCATTT GTATCTAAGC AAGTATTAAT CCCTTCCTCT TTACAAGCAC GGAACCAATC 191220 
GCGTACAAAC TCTG CTTGTA ACACAGCCTC G CC AC C AG AT GCTGTGACAC CACCGCCAGT 191280 
AGCATTCATA AAATGGCGAT AAGTCACGAC TTCTTTCATT AAATCTTCGA CACTGATTTC 191340 
TTTACCACCT TCAAGATCCC AAGTATCACG ATTGTGGCAA TATTTGCAGC GCATCAAGCA 191400 
GCCTTGCATA AATAAAATAA AACGAATACC TGGCCCATCT ACAGTGCCAC AGGATTCAAA 191460 
AGAGTGAATT CGTCCAAGAA CTGACATACA TAAATTTCCT AAAATTAGAT AT AG CTAAAT 191520 
TTTACCAAAG ATTTAGTCTA TAAAAAATGA GCCTGATTTA AATCAGGCTC ATACATATTA 191580 
TGTTCCCGCC AAAGTGCGGT CAAAAATAAA CATGTTTTTA CATTGACTCT GTGAAAGTTC 191640 
TAGTAATTAC GTCTTGTTGT TGCTCTTTAG TTAAAGAGTT AAAACGTACT GCGTAACCTG 191700 
ATACACGGAT AGTCAATTGT GGATATTTAT CCGGATTTTC CATTGCATCT AATAACATTT 191760 
CACGGTTTAA CACGTTCACA TTTAAGTGTT GGCCACCTTC AACCGTTGCT TCGTGGTGGA 191820 
AGTAACCATC CATTAAGCCA GCAAGGTTGC GACGTTGTGC TTCTGCATCT TTACCTAAGG 191880 
CATTTGGTAC GATTGAGAAG GTATAAGAAA TACCATCTTT CGCGTAAGCA AATGGAAGTT 191940 
TAGCTACAGA AGTTAATGAT GCTACCGCAC CTTTTTGGTC ACGACCGTGC ATTGGGTTCG 192000 
CACCTGGTCC G AATGGTG CA CCAGCACGAC GACCATCAGG GGTGTTACCC G TTTT CTT AC 192060 
CATAAACTAC GTTAGAAGTA ATGGTTAATA CAGATTGTGT AGGCACTGCA TTGCGGTAAG 192120 
TTTTAAGTTT TTGAATTTTC TTCATAAAAC GTTCAACTAA GTCACAAGCG ATGTCATCAA 192180 
CACGGTTATC ATTGTTACCA TATTGTGGAT ATTCACCTTC GATTTCAAAG TCGATTGCTA 19224 0 
CGTTAGTTGC AACAACATTG CCATCTTTAT CTTTGATGTC GCCACGAACT GGTTTAACTT 19-00 
TCGCATATTT GATTGCTGAA AGTGAGTCAG CCGCAACAGA AAGACCTGCG ATACCACAAG 192 360 
CCATAGTACG GTATACATCA CGATCATGTA ATGCCATTAA TGCGGCTTCG TATGAATATT 192420 
TATCGTGCAT ATAGTGGATT ACGTTTAAGG CAGTCACATA TTGTTTTGCC AACCAATCCA 192480 
TAAAGCTATC CATACGAGTC ATTACTGTAT CGAAATCTAA T ACTT CAT C A GTAATTGGTG 192 54 0 
CAGTTTTCGG ACCTACTTGC ATACCTAATT TTTCATCGAT ACCGCCGTTG ATTGCGTATA 192600 
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ACAATGTTTT CGCTAAGTTT GCACGTGCAC CGAAGAATTG CATTTGTTTA CCCACAATCA 19266 0 
TTGGTGATAC ACAACATGCG ATTGCGTAGT CATCGTTGTT GAAGTCTGGA CGCATTAAAT 192 72 0 
CATCGTTTTC GTATTGAACT GATGAGGTAT CAATCGATAC TTTTGCACAG AAACGTTTGA 192780 
AGTTTTCAGG TAATTGTTCA GACCAAAGAA TGGTTAAGTT TGGCTCTGGA GAAGTACCCA 192840 
TGTTGTAAAG GGTGTGTAAA ATACGGAATG TATTTTTGGT TACTAATGTA CGACCATCTA 192 900 
AACCCATACC TGCGATGGTT TCAGTTGCCC ACATTGGGTC AC CAGAG AAT AATTGATCGT 192 960 
ATTCAGGTGT ACGTAAGAAA CGAACCATAC GAAGTTTCAT AACTAAGTGG TCAACTAATT 193 020 
CTTGCGCTTC AGTTTCAGTA ATTTTTCCTG CTTTTAAATC ACGTTCGATG TACACGTCAA 193080 
TAAAGGTTGC GGTACGACCG AATGACATTG CAGCACCATT TTGTGATTTT ATTGCAGCAA 193140 
GATAAGCAAA GTACATCCAT TGAATGGCTT CTTGAGCATT AGTTGCTGGG TTAGAAATAT 193200 
CATAACCATA GCTTGCTGCC ATTTGTTTTA ATTGACCTAA TGCACGGTGT TGTTCTGCGA 193260 
TTTCTTCACG TAAACGAATT GTTGCTTCAA GATTTACGCC ATCTTCTAAA TCTTTTTGTA 193320 
AAGAAGAGAA TTGTGCGTAT TTATCTTTCA TTAAGAAATC TACAC CATAA AGTGCTACAC 193380 
GACGGTAGTC ACCGATGATA CGACCACGAC C ATAAG CATC TGGAAGACCA GTTAATACCC 19344 0 
CAGATTTACG GCAACGTAAA ATATCTGGCG TGTAAACATC GAATACACCT TGGTTATGTG 193 5 00 
TTTTACGGTA TTCAGTGAAG ATTTTTTTCA CTTTTGGATC AAGTTCACGA CCATAAACTT 193S60 
TACAAGAACC TTCCACCATT TTGATACCAC CGAATGGCAT AATGGCACGT TTTAAAGGTT 193620 
CATCAGTTTG AAGACCAACG ATTTTTTCTA AATCTTTGTT AATGTAACCA GGTGCGTGAG 193680 
AGATAATGGT AGATGGTGTA TGTTCATCAA AATCTAATGG CGCGTGAGTA CGGTTTTCAA 193740 
TTTTAATACC TTC CATCACA GATTCCCAAA GCTTGGTTGT TGCTTCGGTT GGAC CTGCT A 193800 
AGAAAGAGTC ATCGCCTTCA TAAGGGGTAT AGTTTTTTTG GATAAAGTCA CGTACATTGA 193860 
CATTTTCTTG CCAATCGCCA CCAGCAAAAC CAGCCCACGC CAATTTTTGC ATTTCATTAA 193 920 
GTTCTGACAT AGTCATTTCC TTTGTTAATT AATAAATAAA TCTTTAATGT GTTTTGGTTA 193 980 
AATAACGTTG GAATACACCA ATACAAATTG CCCCGCCGAC AATATTTCCC AACG TT ACAG 194040 
GGATTAAATT CTTCACGATG AAATGGTACA AATCTAAATC TGCGTATTTC ATCGGATCAA 194100 
CACCGATTTG TTGCCAAAAT TCAGGTGTAC TAAAATGTGC GGTAATAATT CCCATCGGAA 194160 
TCATAAACAT ATTAGCCACA CAGTGTTCAA ATCCTGACGC GACAAACAAA CCAATCGGCA 194220 
TAATCATAAT AAATGCTTTG TCTGTAACAG TCTTACCAGA ATAAGACATC CAAACCGCCA 194 2 80 
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CACACACCAT AATGTTACAT AAAATACCTA GGTTAAAAGC TTCAAACCAA GTGTGATGAA 194 34 0 

TTTTATGTTG GGCGGTTGCA AGAATTGTTA AACCCCATTG GCCATTTGCT GCCATTGTTT 1944 00 

GTCCACTAAA CCAAATAACA GCCGCAATAA AT AAAC C AC C AACGAAGTTG CCTAAATACA 19446 0 

CCACGATCCA ATTTCGAATC ATTTGTGTCG TGGTGATTCT ACCGCCTACA CGGGCGACTA 194 52 0 

AAGTTAAAGT AGAAGAAGTG AATAATTCAG ATCCTAAAAT CACCACCATA ATTACGCCCA 194 580 

AGGAGAAGAC TAATCCACCT ACTAACTTGG TTAATCCCCA AGGTGCGCCT G C ACT AG CAG 194640 

TTTGTGTTGT CGTGTAAAAT ACAAAGGCTA AGGCAATACA AGCACCGGCA CTAATACCGG 194 700 

ACATAAACGA TAAAAATGGA CGTTTATTTG CTTTATAAGT TGCTGTATTT TCTGCATAAT 194 76 0 

CCGTTGCTTC AACAGGTGTG AGCGCAACAG AAGATAGTTT TTGATTTTCT GATGCCATAA 194 820 

TCAAACTCCA ACTATTGACA TTGAAAAAGT TGATGAAAAT TTAATACTTT TAACAAGTTT 194 88 0 

ACGACTACAA ATACTTCTAT AATTGGTGCT ATTTTACTCT CCTCCCCTTT CATTGCAAAG 194 94 0 

AAAAGCAAAA ATTATTTCAA CAAACTTTGA TATTTTGCAA GGTTTGTTTA ATTAAAGACA 195 000 

AAATTCCCAT ATTAATATTC GCTCAATTTC ATACATTCAG AGTAAGTAAA GTACTTAATT 195060 

TGAAACTTGT TTTATTCGCG TAAT AC CGTG AATTCCCCCA GATAATCATC CACTTTTCCA 19512 0 

AAACCTTGAT TATTCACAAC AGGCAAATTC ATAATTTTTT CAAACGAACC AGATTAAGTG 195180 

TATAGACTTG CTATTTTGAG TAATTAATAG CATTATTTTG ATCTTTTTGA AGAAAAAGGC 19 5240 

GATCTTTGTC ATAAAATAAT GATTTAAGTG CGGTTAAAAT TTGGTTAATT TTAAGTGCAC 195300 

TTCTTTATAC AGGAGCAGAT TATGTATTAT GGTTTAGATA TTGGTGGCAC GAAAATAGAG 195360 

CTTGCCGTAT TCAATGAAAA ATTGGAAAAA TTATATTCTG AACGGGTGCC AACAC CT AAA 19542 0 

ACCGACTATG AAGAATGGCT TAACACGATT GTGGATTTAG TGAATCGTGC TGATGAAAAG 19 5480 

TTTGGTGAGG TGGGCACCGT AGGTTTGGGC GTACCTGGTT TTGTTAATCA ACAAACTGGA 195 54 0 

TTAGCCGAAA TTGCCAATAT TCGTGTGGCA GATAATAAAC CAATTTTATG CGATTTGTCT 195600 

ACGCGTTTGG GCCGTGAAGT GCGGGCAGAA AATGACGCAA ATTGTTTTGC GTTATCTGAA 19 5660 

GCATGGGATA CGGAAAATCA GCAATATTCG ACTGTACTTG GTTTAATTCT TGGGACTGGT 19572 0 

TTTGGTGGTG GCTTTGTCTT AAAC GGTAAA GTTCATTCAG GTCAAGTGGG AATGGCGGGA 195 78 0 

GAGCTTGGTC ATTTACAATT AAATTATCAT GCTTTGAAAT TATTGGGATG GGATAACGCC 19584 0 

CCAATTTATC AATGCGGTTG CGGCAACAAG GCTTGTTTGG ATAATTATCT GTCTGGTCGC 19590 0 

GGTTTTGAAA TGCTTTATCA AG AT TTGAAA GGCGAAACAT TATCTGCTCG TAAAATTATT 19596 0 

-77.118- 

SUBSTTTUTE SHEET (RULE 26) 



BNSOOCIO <WO_9833276A1 J^> 



WO 96/33276 

PCT/US96/05320 

AATTTGTTCT ATCAAAGCAA TGAAAGTGCG GTAGATTTTG TGAATCTTTT TGTTGAGTTG 196020 
GCGGCGATTT CTATTGGCAA TATCATTACG GCATTTGATC CGCATATGAT TGTGTTAGGT 196080 
GGTGGATTAT CTAATTTTGA TTACCTTTAT GAGGCTTTAC CAAAAGCnCT TCCGCCTCAT 19614 0 
TTGATGCGAA AAGCAAAAGT TCCGCCAATT AAAAAAGCGA AACACGGCGA TTCTGGTGGT l 96200 
GTGCGTGGTG CGGCAGCGTT GTTTTTAACC AAATAATTCA ACAAACTAGT GGGGCGTATT l 96260 
CAAATAATAC GCCCTTTATT TTGGAAAATA TTTAGAATAT TGTGAATTTA TTAGATTTTT l 96320 
TCTTGTTTGT GTTTCTTTCT TTAGAGAAAT TATTAAAAAA TCTCGCCTTT TCATTGAAAA 196380 
GCCATACAAT CAAATTATTT ATCGCTTGAG GGATGATTAT GGAATTTGAA TTTTCAAAAA 196440 
TGTTAGAAGA AGTGCTAACT TGGATAGTCG CACACCTTGA TGGACCTTTA TGGGATGCCA 1 965 00 
CCATTATTAT TTTGCTTGGG ACTGGTCTAT TTTTTACCAT TACAACAGGA TTTGTGCAGT 196560 
TCCGTTTATT CCCAGCAAGC CTTCGTGAAA TGTGGTTTGG TCGTTCGGTG GAGGGGAGTT 196620 
CATTAACACC TTTCCAAGCG TTTACAACAG GTCTTGCGAG CCGCGTTGGT GTGGGTAACA 196680 
TTGGTGGGGT TGCAACGGCA ATCGCCTTAG GGGGCGAAGG CGCAGTGTTT TGGATGTGGG 196740 
TAACGGCATT TATTGGTATG TCGAGTGCTT TCGCTGAATC TACCCTTGCT CAATTATTTA 196800 
AAATTCAAGA TAAAGATGGA TCATTCCGTG GCGGCCCTGC TTATTATATT GTGCAAGGTT !96860 
TAAAATCACG TTG TATGGCA GTGGCTTTTG CGCTTGCATT AATTTTTACA TTTGGTTTTG 196920 
CCTTTAATTC TGTGCAGGCA AACTCTATTG TTGAAGCAAC CAGCAATGCG TGGAATTGGA 196980 
AAGGGGAATA TGTCGGTATT TCATTAGTAA TTTTTACCGC ACTTATTATT TTCGGTGGCG 197040 
TTAAGCGTAT TGCGATTATT TCGAGCAACC TTGTACCAAT GATGGCACTT TTCTATTTGA 197100 
TTATGGCGGT AATTATTCTT GGCATGCATA TTGATATGAT CCCTTCCGTG ATTCATCGTA 197160 
TTGTTCAAAG TGCATTTAGT TTTGATGCTG CCGCTGGCGG AATGTTTGGT GCATTGGTAT 197 220 
CGAAAGCAAT GATGATGGGG ATTAAACGTG GTTTATTCTC AAACGAAGCA GGGATGGGAT 197280 
CTGCGCCTAA TTCGGCTGCA GCAGCTCACG TTAAGCATCC AGTTAGCCAA GGTTTAGTG C 197340 
AAATGCTCGG GGTGTTTGTT GATACAATGA TCGTTTGTAC TTGTACTGCC GTTATTATTT 197400 
TGCTTTCGAA TAATTATGGT AGCGAAACGC TCAAAAGTAT CTCTCTTACG CAAAATGCTT 1974 60 
TGCAATACCA CATAGGTGAA TTTGGGGCGC ATTTCCTGGC GTTTATCTTA TTGTTATTCG 197 520 
CTTATTCTTC TATTATTGGT AACTATGCTT ATGCGGAAAG CAACATCCGT TTTATCAAGA 197580 
ATAAACCTTG GTTGGTCTTG TTGTTCCGTT TAATGGTGCT ATTTTTCGTG TATTTCGGTG 19764 0 
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CGGTTCGCTC 


TGGTAATGTG 


GTGTGGAATT 


TC G C AG AT AC 


GGTGATGGCT 


GTCATGGCAA 


197700 


TCATTAACTT 


GATCGCAATT 


TTGATGTTGT 


CGCCAATCGT 


ATGGAAATTA 


ATGAAAGATT 


197760 


ATCAACGCCA 


GCTTAAAGAA 


GGAAAAACGC 


CAGAGTTTAA 


AATTGATGAA 


TACCCTGAAT 


197820 


TACGTAAGAA 


AATATTTGAT 


TCCCGCATTT 


GGAAATAAAG 


ATAAAATGGC 


TAGAAAGAGA 


197880 


TTATCTACCT 


TTCCAAAAAG 


AGTTAGATTA 


TTATTTAAAG 


AAATTTTTTG 


ATATTGTTTC 


197940 


AGGCTTGGTT 


CTTCTAGTTC 


AGACTGAATA 


AACTATTGCC 


GGCACTTTTT 


ACCGCCCTCG 


198000 


TGCCGGCTAT 


CCCAAACTGC 


TTTGAAAACT 


GACCGCTCTT 


TAGCTTACTT 


CAAAAATTCC 


198060 


GCATGATAGG 


CAATATGCTC 


GCCAATAAAA 


CTGGCGATGA 


AGTAATAGCT 


GTGATCATAG 


198120 


CCTTTATGGA 


AGCGCACATC 


GACTGGCTGA 


TTTGCCACTC 


GACAGGTTTC 


TATAAAATCT 


198180 


TCGGTACGCA 


ATTGTGTCGG 


TAAAAACTCA 


TCTTCCAAGC 


CCTGATCAAT 


GCGCATACCT 


198240 


TGCACTTTAT 


AGCCTTGTTG 


AATG AG C G AG 


CTGGCATCAT 


ATTGCTGCCA 


TTTTTCACGA 


198300 


TCTTCCCCTA 


AATAGGCAGA 


AAAGGCTTTT 


TCTCCCCAAG 


GCACAAGGCT 


TGGCGACAAA 


198360 


ATGGG CG AAA 


AGGCAGAAAC 


GCTTTGATAA 


CGTTCCCGAT 


TTCGCAGTGC 


CAATACCAAT 


198420 


GCACCGTGTC 


CACCCATTGA 


ATGTC C CAT A 


ATGGAACGTT 


TGCCGTTGGT 


AGGGAAATTT 


198480 


GCTTCAATCA 


AATCAGGCAA 


TTCATTCAAG 


ATATAATCAT 


ACATTTGATA 


ATTCGTCGCC 


198540 


CAAGGCTGCT 


CGGTCGCATT 


AAGATAAAAG 


CCCGCTCCCT 


GCCCTAAATC 


GTAAGCCGCA 


198600 


TCATTCGGCA 


CTTGCTCTCC 


ACGAGGGCTT 


GTATCGGGAG 


CAACAACAAT 


CACTTGATGT 


198660 


TCTGCCGCGT 


AACGCTGGAA 


GCCTGATTTG 


GTAATGAAAT 


TTTG CTCAGT 


ACAAGTTAAG 


198720 


CCTGAAAGCC 


AATAAATCAC 


ACCAAGCGGT 


CGATTTTCTG 


GATTATTTGG 


CAAATAAACG 


198780 


GCAAATTTCA 


TTTCACATTG 


AAGTGTTTGA 


GCATTATGCG 


CCCAAACTTG 


CTGCGAACCG 


198840 


CCAAAAATTT 


GATGTTGTTC 


AATTAGTTTC 


ATAGCGTACC 


TTAG TAGTG A 


ATAAC AG T AC 


198900 


GGATTGATTT 


ACCTTCGTGC 


ATTAATTCAA 


AGGCTTCATT 


AATTTTATCA 


AGTGGCATTG 


198960 


TGTGAGTCAC 


AAACGGTTCT 


AACTGGATGT 


CGCCTTTCAT 


TGAATCTTCC 


ACCATTTGCG 


199020 


GAAGTTCAGA 


GCGACCTTTC 


ACGCCACCAA 


ATGCCGAACC 


TTTCCAAACA 


CGGCCTGTTA 


199080 


CCAATTGGAA 


CGGACGCGTT 


GAAATTTCTT 


GTCCTGCACC 


CGCTACGCCG 


ATGATAATGG 


199140 


ATTGTCCCCA 


GCCACGGTGA 


GCACTTTCTA 


ATGCTTGACG 


CATTACGTTT 


ACATTACCGA 


199200 


TACATTCAAA 


GGTATGGTCA 


ATGCCCCATT 


TATTAATATC 


TAACAACACA 


TCTTTGATCG 


199260 


GCTTATCGTA 


ATCATTAGGG 


TTTAAACAAT 


CCGTTGCACC 


AAACTGTTTT 


GCCAACTCAA 


199320 
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ATTTTGCAGG 


ATTGGTATCA 


ATAGCGATAA 


TGCGACCTGC 


TTTGGCTTGA 


CGCGCACCTT 


199380 


GCACTACCGC 


TAAACCAATC 


GCACCCAAGC 


CAAACACGGC 


AACAGAG TCG 


CCTTCTTGTA 


199440 


CTTTTGCTGT 


ATTATGCACC 


GCACCAATAC 


CTGTAGTGAC 


ACCGCAGCCG 


AG C AAAC AC A 


199500 


CTTGTTCGTG 


GTTCGCTTCA 


GGGTTGATTT 


TCGCCAGTGA 


AACTTCCGCA 


ACAACTGAGT 


199560 


ATTCACTGAA 


GGTTGAACAG 


CCCATATAGT 


GATAGATCGG 


CTGACCTTGA 


TAAGAAAAAC 


199620 


GTGTCGTGCA 


ATCAGGCATT 


AAGCCTTTAC 


CTTGCGTGTC 


ACGCACTGAG 


ACGCATAAGT 


199680 


TGGTTTT AC C 


CGAGCGGCAA 


AACTCACATT 


CGCCACATTC 


AGCTGTATAA 


AGAGGAATGA 


199740 


CGTGATCGCC 


CGGTTTTACG 


CTTAACACAC 


CTTCGCCAAC 


CGCAACAACC 


ACACCCGCAC 


199800 


CTTCGTGTCC 


TAGCACCACA 


GGGAATACCC 


CTTCAGGATC 


GCTTCCTGAT 


AACGTAAACG 


199860 


CATCAGTATG 


GCAAACGCCC 


GTGTGAGTGT 


TGCGGATTAA 


CACTTCACCT 


TTGCGTGGCA 


199920 


TTTCTACGTC 


GATTTCTACA 


ATTTGTAAGG 


G TTG ATTTGG 


GGCGAATGCT 


ACCGCCGCAC 


199980 


GAGATTTGAT 


GGTTGAATTG 


ATTTGTTTAA 


TTTCCATTGT 


TTTTCCTCTT 


AATTCATTGA 


200040 


CAAAGTTGCC 


TTACGTAAGC 


AAGATAGCAA 


CAGTTTACAC 


TTTAGAGTTC 


ACTCTAAAGC 


200100 


AAGTGTTTTT 


TTATCATTTT 


TGTTTAAAGG 


GAATAAAAAG 


TGACTTACAC 


CACCGCTAAA 


200160 


GCTGCTGAAA 


AAATAGGCAT 


TTCCGCCTAC 


ACCTTACGCT 


TTTATGACAA 


AGAAGGCTTG 


200220 


TTACCTAATG 


TCGGACGTGA 


TGAATACGGT 


AATCGCCGTT 


TTACCGACAA 


GGATTTG CAA 


200280 


TGGTTGAGTT 


TATTACAATG 


CTTGAAAAAT 


ACAGGAATGA 


GCTTAAAAGA 


CATCAAACGC 


200340 


TTTGCGGAAT 


G TAC CAT CAT 


TGGCGACGAT 


ACTATTGAAG 


AACGCCTTTC 


CTTATTTGAA 


200400 


AATCAAACAA 


AAAATGTAAA 


GTGTCAAATT 


GCCGAATTAA 


AACGTTATTT 


AGATTTGCTT 


200460 


GAATACAAAT 


TAGCGTTTTA 


CCAAAAAGCG 


AAAGCACTAG 


GCTCGGTAAA 


AGCTGTGAAT 


200520 


TTACCGCAAA 


TTCCTGAAAC 


GAGCTAAAAG 


GGTAAAAGTG 


TTCACATCAC 


AGAATTAGAA 


200580 


GAAAAATACA 


ATTTTTAGAG 


ATAGATAAAT 


TTTAACTTTT 


TTATTACATA 


AATAAGTTAA 


200640 


AAATCAGGTT 


TTGTGATCAA 


GATCAGGATA 


ACATAAAAAT 


AATCTCCGAT 


AGCTATTGTT 


200700 


CTAAATGGCT 


AAAAAGAGTA 


TATTCCGAGC 


CAAATTTTTT 


CTTTTTTACA 


GAACGGAGTT 


200760 


TATTATGTTT 


GGTTTATCCC 


CCGCTCAACT 


GATTATTTTA 


TTAGTCGTTA 


TTTTATTGAT 


200820 


TTTTGGCACG 


aAAAAATTAA 


GAAATGCTGG 


CTCAGATCTT 


GGTGCAGCAG 


TGAAAGGTTT 


200880 


CAAAAAGGCG 


ATGAAAGAAG 


ATGAAAAAGT 


AAAAGATGCG 


GAATTTAAGT 


CTATAGATAA 


200940 


TGAAACAGCG 


TCTG CAAAAA 


AAGGAAAATA 


TAAAAGAGAA 


AGGAACAGGC 


TTAATCCGTG 


201000 
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TTTGATATTG 


GTTTTTCAGA 


ACTTATTTTA 


CTAATGGTTT 


TGGGATTGGT 


TGTATTAGGC 


201060 


CCGAAACGTT 


TGCCTATAGC 


AATTCGCACT 


GTAATGGATT 


GGGTAAAAAC 


AATTCGTGGT 


201120 


TTAGCTGCTA 


ACGTTCAAAA 


TGAGTTGAAA 


CAAGAGCTAA 


AATTGCAAGA 


ATTACAAGAT 


201180 


AGTATTAAAA 


AAGCGGAATC 


CCTTAACTTG 


CAAGCCCTTT 


CACCAGAATT 


GAGTAAAACG 


201240 


GTGGAAGAAT 


TAAAAGCTCA 


AG C AG AT AAG 


ATGAAAG CGG 


AGCTAGAAGA 


TAAAGCAGCT 


201300 


CAAGCTGGTA 


CAACGGTGGA 


AG AT C AAATC 


AAAGAAATCA 


AAAGTGCGGC 


TGAAAATGCA 


201360 


GAAAAGTCTC 


AAAATGCAAT 


CTCAGTTGAG 


GAGGCTGCAG 


AAACTTTATC 


TGAAGCAGAA 


201420 


AG AACAC CTA 


CAGATTTGAC 


CGCACTTGAA 


ACTCACGAGA 


AAGTAGAACT 


GAACACACAT 


201480 


TTATCTAG CT 


ATTATCCTCC 


TGATGATATT 


GAAATCG CTC 


CAGCGTCAAA 


ATCTCAATCT 


201540 


TCAAAAACTA 


AATCATAGCG 


AGTTTTTATG 


AGTAATGTGG 


ATGAATCTCA 


ACCTTTAATT 


201600 


ACGCACCTTG 


TTGAACTAAG 


AAACCGTTTA 


TTACGCTGTG 


TGATTTGTGT 


AGTGTTGGTT 


201660 


TTTGTTGCTT 


TGGTGTATTT 


TTCCAATGAT 


ATTTATCATT 


TTGTTGCTGC 


ACCTTTAACG 


201720 


GCTGTTATGC 


CAAAAGGCGC 


AACAATGATT 


GCAACCAATA 


TTCAAACGCC 


TTTCTTTACG 


201780 


CCAATTAAAT 


TAACTGCGAT 


TGTTG CAATT 


TTTATTTCTG 


TCCCTTATTT 


GCTTTATCAA 


201840 


ATTTGGGCTT 


TTATTG CACC 


TGCTTTATAT 


CAGCACGAAA 


AACGGATGAT 


TTATCCGTTG 


201900 


TTATTTTCCA 


GTACGATTTT 


ATTTTATTGC 


GGTGTTGCTT 


TTGCTTACTA 


TATCGTCTTT 


201960 


CCTCTTGTAT 


TCAGCTTCTT 


TACACAAACA 


GCCCCAGAAG 


GCGTTACCAT 


CGCAACAGAT 


202020 


ATTAGTAGTT 


ATTTAGATTT 


TGCTTTAGCG 


TTATTTTTAG 


CTTTTGGCGT 


TTGTTTTGAA 


202080 


GTGCCAATTG 


CGATTATTCT 


ACTTTGCTGG 


ACTGGTATTA 


CCACTGTAAA 


GGCTCTTTCC 


202140 


GAAAAACGCC 


C TT AT ATT AT 


TGTTGCGGCA 


TTTTTTATTG 


GTATGCTTTT 


AACGCCTCCT 


202200 


GATGTTTTTT 


CACAAACTTT 


GCTTGCCATA 


CCGATGTGCT 


TGCTGTTTGA 


GCTCGGTCTA 


202260 


TTGGTCGCTC 


GATTTTATCA 


ACCAAAAGAC 


GACGAAAGTG 


CGGTTAAAAA 


TAATGATGAA 


202320 


TCAGAAAAAA 


CACAATGATG 


AACTGAATTG 


TTAATCGTAT 


GGGGGCTTTG 


CCCCCGTTTT 


202380 


TATTATATTG 


ATATAACCCT 


AATTTTGTTT 


TTTCATAAAA 


CATATTTTCG 


TAGTGATTTA 


202440 


CTAGAAATAT 


GATTTGTAAT 


CTGCTCTATA 


TTTTTATTGA 


AAATATCATT 






TTTCTGTCGT 


TTTTTATTTT 


TTATCTAATA 


AATACGATAT 


TTTTTAGATA 


ATTTCTTTAC 


202560 


TTTTTCCTTT 


TCTTTAAATT 


TCCTTTATAA 


AAATGGTTGC 


AAGATTAAAT 


TAGCCATTGC 


202620 


ATATTG TC T A 


TGCGCCTCAC 


GAGGCGAATA 


CAATAAAAAT 


ACAATACACA 


CAACATCAAC 


202680 



-77.122- 

SUBSTTTUTE SHEET (RULE 26) 

BNSOOCID: <WO 9633276A1 J > 



WO 96/33276 PCI7US96/0S320 

AAAGAGGGAA ATTCATTATG TCAAAAGTTG CTTCCTTAGA CGCATTTTTA ACAAAAGTTG 2 02 74 0 

CTCAACGCGA TGGTTATCAA CCTGAATTTT TACAAGCGGT TCGCGAGGTA TTCACATCAA 2 028 00 

TTTGGCCTTT TTTAGAAGCC AATC CTAAAT ATCGTTCAGA AGCATTATTA GAACGTTTAG 2 02B6 0 

TTGAGCCTGA ACGTGCATTT CAATTCCGTG TGGCGTGGAC TGACGATAAA GGGCAAGTGC 2 02 92 0 

AAGTAAACAG AGCATTTCGT GTACAATTTA ATAGTGCCAT AGGCCCATTT AAAGGGGGAA 2 02 980 

TGCGTTTCCA TCCATCAGTA AATTTATCTA TCTTAAAATT CTTAGGTTTT GAGCAAATCT 2 0304 0 

TTAAAAATGC TTTAACAACA TTGCCTATGG GCGGGGCAAA AGGCGGTTCA GATTTTGATC 2 03100 

CTAAAGGCAA ATCTGATGCT GAAGTTATGC GTTTTTGCCA AGCATTAATG GCTGAACTTT 2 03160 

ATCGTCACGT AGGAGCTGAT ACAGATGTTC CCGCAGGCGA TATAGGCGTC GGTGGGCGCG 2 0 3220 

AAGTTGGCTA TTTAGCTGGC TATATGAAAA AATTATCAAA CCAATCAGCC TGTGTTTTCA 2 03280 

CTGGTCGCGG TCTTTCTTTC GGTGGTAGTT TAATTCGTCC GGAAGCAACG GGATATGGAT 2 03 34 0 

TAATTTATTT TGCTCAAGCA ATGCTTGCTG AAAAAGGCGA TAGTTTTGCA GGTAAAGTAG 2 03 4 00 

TTTCAGTTTC TGGTTCTGGT AATGTAGCAC AATATG CTAT TGAAAAAGCA TTATCTCTTG 2 03460 

GTGCAAAAGT AGTAACTTGT TCTGATTCAT CAGGTTATGT TTATGAt CCA AATGGATTTA 2 03 5 20 

CTACTGAAAA ATTAGCCGCA CTTTTCGATA TTAAAAATAC AAAACGTGGG CGTGTGAAAG 2 03 58 0 

ATTATG CAG A ACAGTTTGGT TTGCAATATT TTGAAGGTAA ACGCCCTTGG GAAGTGCAAG 2 03640 

TTGATATTGC GCTTCCTTGT GCAACTCAAA ATGAATTAGA ACTTTCTGAT GCACAACGTT 2 03 700 

TAATTAAAAA TGGTGTGAAA TTAGTGGCTG AAGGTGCGAA TATGCCAACA ACAATTGAAG 2 03 760 

CAACAGAAGC ATTACTAGCT GCAGATGTAT TATTTGGCCC GGGTAAAGCT GCCAACGCTG 2 03 82 0 

GTGGTGTTGC TACTTCTGGT TTAGAAATGG CACAAAGTTC ACAACGTTTA TATTGGACAG 2 03 880 

CGGAAGAAGT GGACG CTCAA TTACATCGCA TTATGTTAGA TATTCACGCA AACTGTAAAA 2 03 940 

AATACGGCAC AATTGAAGGT CAAGAAAACA .TTAACTATGT TGTTGGGGCA AATGTAGCAG 2 04 000 

G CTTTGTT AA GGTGGCTGAT GCAATGTTAG CCCAAGGCGT TTATTAACAA ATAAAAAAGT 2 04 06 0 

TAAACTAAAT AACCGCACTG TAATAAAGTG CGGTTATTTT TTTAGGATGA AATTTGAATA 2 0412 0 

TAAGAAAGCG ACATTTTGTC GCCTTAGATA TTATTTATTT TGAATTGTTT TCGTCACAAT 2 0418 0 

GATTAATATC ACTGCATTTG C CGTAAAG AT ACACGTTATG CGTTTTTAAT TTTATGCCGT 2 04 24 0 

ATTTTTCACT GATTTCACGC TGACGTTGTT CAATAATATT ATCCGTAAAT TCAAATACTT 2 043 00 

TACCGCAATC TTCACAAATA ATATGATCGT GATGTTCTGT TGGAGCAAGC TCAAAAACGG 2 0436 0 
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ATTTATTTCC 


CTCAAAATTA 


TGACGGATTA 


CAATATGCGC 


TTCATCAAAT 


TGATTAAGCA 


204420 


CACGATAAAC 


TGTGGCTAAT 


CC AATTTC AC 


AACCTTGTTC 


CAGGAAAATT 


TTATAAACAT 


204480 


CTTCTG CAG A 


AAAATGTTCA 


TTTTTATGAT 


TTTGCATTAA 


AG CG AGAATA 


GTTAAGCGAG 


204540 


GCTCTGTAAT 


TTTTAATCCC 


ACTTTTTTGA 


GTAATTTAAT 


ATTTCCTTCA 


GACATAATTT 


204600 


TCCTTCCTAT 


TATGCACAAA 


ATGCTAAGCC 


AATTCAGCTA 


AGCACATTTC 


ATCATAAATT 


204660 


TGTTTACACC 


ATTTTTCTAC 


ACGCTCTGCG 


GTAAGCTCTG 


GTTGGCGATC 


TTCATCAATA 


204720 


CATAATCCGA 


TGAAAGTGCC 


ATCTTCCAAT 


AAGGCTTTCG 


AAGCTTCAAA 


ATTATAGCCT 


204780 


TCTGTTGGCC 


AATTTC CTAC 


C AC AATTG CA 


CCGTGTGGCT 


CTATAATATC 


GCGCACAGTT 


204840 


CCGATAGCAT 


CACAGAAATA 


ATCTGCATAA 


TCTTCTTGAT 


CGCCACAACC 


GAAAATACCT 


204900 


ACAAGTTTAT 


CTGTAAAATC 


AATTTCTTCG 


AGTGTTGGGA 


AAAAGTCATC 


CCAGTCTGCT 


204960 


TGTGCTTCGC 


CGTAATACCA 


AGTTGGGATA 


CCGAAAAG CA 


AG AAATC G T A 


TGCTTCAATA 


205020 


TCTTCTTTGC 


TACTTTTGGC 


AATATCACGA 


ATATCAATTA 


AATCACTACC 


TAATTGTTTT 


205080 


TGGATTTGTT 


TTGCGATGTT 


TTCAGTATTT 


CCAGTATCAC 


T AC C AT AAAA 


AAGACCAACT 


205140 


ATTGCCATTT 


TTATTTCCTT 


TTTTATATTA 


AATTTAATCT 


ATTGATTATA 


AGGCTTATCC 


205200 


AAATTTGGCG 


AACTATATCA 


TAAAAGCATT 


CTCACTTAAA 


CTACTTAATT 


CTTATTTAGA 


205260 


AATCGTTTAA 


TTGCACGAAT 


AACAAAATCA 


GGTTTTTCGG 


CGTGAACCCA 


GTGTCCACTG 


205320 


CCATTAATCG 


TAAAGGCAGT 


TGCATTAGGA 


AATTGTT CT A 


GAATTTTTTC 


GCTATTTTCT 


205380 


ATTTTTATAT 


AAGAGGAATT 


TCCACCCTTA 


ATAAAAAGAG 


TTGGCGTAAA 


TACACGTACT 


205440 


TTTTCCCAAT 


CCATAATATT 


GG CAT AATTG 


TTAAAAAGCG 


CGGTAAGATT 


AAAACGAAAA 


205500 


CAATCAG CTG 


AATTAACATC 


AAAAGATTTC 


AACATAAATT 


GTACAACGTC 


TTCATCATTA 


205560 


ATTTCTTGTT 


TTAAAATCGG 


TTTGGCTTGC 


TGACGATTTT 


CAGGCTTTGC 


ATTTTTT AC C 


205620 


GCAAATAGGC 


CATTAAAAAC 


ATCTTTATGA 


CCAAATCCCT 


CATAAGGCAT 


AGGCGACATA 


205680 


TCAATTACGA 


TTAATTTTTC 


GACTAATTCT 


GGGCAAAGTG 


CGGTGATTTT 


CATGGCGGTT 


205740 


TTTCCCCCCA 


TAGAATGACC 


AATTAATATC 


ACTTTGGATA 


AATTTAAGTG 


GCGGATTACT 


205800 


GCGATAACAT 


CTTCAGCCAT 


CAGTTGATAA 


TTCATTTTTT 


CCGAATGAAA 


ACTATGGCCG 


205860 


TGATTGCGTA 


AATCAATACG 


TAAAATACTG 


TAATGTTCAC 


TAAAAGCTCT 


CGCGATAACG 


205920 


CCGAGATTAT 


CCATATCGCC 


GAATAAACCG 


TGAATAAAAA 


TTAAAACAGG 


TGTATTAATC 


205980 


GTTTGTTTTA 


CTTGATGAAA 


TTGATAATTG 


AGTAATGATT 


TTGCCATAAT 


TTTTACGTGA 


206040 
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GAATTTGTAA AGAAGTCGTT ATAATTGAAA AAAATTTTAG CAAACAGAGA GATAAAAATG 2 06100 

AAGATCATTG AAGTAGATGA AGAATTATAC CAATATATTG CGAGCCAAAC CCGCTCTATT 2 0616 0 

GGAGAAAGTG CTTCGGATAT TTTACGCCGT TTGCTTAGTT TACCTGTGCA TACTTCAATT 206220 

GTTAATGATT TAATTATTAC TTCAGCAGAA ACAGATCAAA AGCCAAAACA GGCTATCAAT 2 06280 

GTGAAAGAAG TAAATATCAA AACAACAAAA AAGCAATCAA TAACCGCAAT TAATCAGATT 2 0634 0 

GTAGAAAAAG TCCAAACTTT ATTAAATTCA ACTGAATTTC AAGAAGAAAG TAAGGCTGTT 2 06400 

GTACGTTTCT TAG CGATTTT ACGTGTGCTT TATCGTACAA ATCCTGAAAG TTTTGCGCAA 2 0646 0 

GCAACGGAAT CTTTACAAGG ACGTACTCGA GTTT ACT TTG CACGCGATGA AGCGACATTA 2 06 520 

TTAATGGCAG GTAACCATAC TAAACCAAAA CAAATTCCAG ATACGCCTTA TTGGGTTATT 206 580 

ACCAATACAA ATAGTGGACG AAAAATGTTA ATGCTAGAAG GTGCAATGCA ATCTATGGAA 2 0664 0 

TTAC CTG AAA CATTGATTGA TGAAGTGCGA TCATATTTCA CGGTAAATTA ATAATGTATC 2 06700 

CTTGGCAAGA TTTTGCTATT CAACCTGATT TCTCAGATAA AATCGCTTTG CGTACTACGC 2 06 76 0 

AGGGCGATAT GCTTACTTGG ATAGAATTGA CTACAAAGAT TAACCAAACA GTGGCTTTTT 2 06 82 0 

TACAAAAAAA AGGCGTAAAT GCAGAAAGTG CGGTTGCTTT TGTGGGAAAA AATTCAGAGA 2 068 80 

AAATTTTATT TTTATATCTG GCGACAATTC AGCTTGGCGC AAAAGTTTTA GGCATAAACC 2 06 94 0 

CTGCTTTTCC ACAAGAAAAA ATTGCAAAAT TATGTGAGTT TTATCAAATT GATTTTTGTT 2 0 70 00 

TTTATGATAA AGATTTACTG AATTTGCAAG AAATTGATGT TTTTACTCAA AAAGCCGATT 2 07 060 

TTTTTCGTCC TGCGACGATG ACGCTAACGT CTGGCTCGAC AGGTTTACCA AAAGCAGTTG 2 0712 0 

TGCATAATGT CCAAGCACAT TTGGATAATG CAAAAGGGGT ATGTAACTTA ATGAAGTTTG 2 0 718 0 

ATTGTAATCA ATCTTGGTTA CTTTCATTAC CCTTATATCA CGTTTCAGGG CAAGGTATTG 2 0724 0 

TTTGGCGTTG GTTATATTGC GGTGCACAAT TACATTTCCC AGAAGATGAT TTTTATGCTT 2 0 7300 

CATTATTAAA GACGACCCAC GTTTCTCTTG TGCCAACGCA ATTACAGCGT TTATTAGATT 2 073 6 0 

ATTTACAGGA AAATCCGAGC ATTTCATTTG CTACACGCCA TATTTTACTG GGCGGTGCGC 2 0742 0 

ATATTCCGAC AGAACTTACA CAAAATATGT TGAAATATGG TATCGAAACG TATTCTGGCT 2 0 7480 

ACGGAATGAC GGAAATGGCT TCGACAGTTT TTGCTAAAAA ATCTGACAGA AAACAAGGCG 2 07 54 0 

TAGGGCAACC GCTCTTAGGT AGAGAGTATT GTTTAGTAAA TGATGAAATT TGGCTGAAAG 2 0 76 00 

GTGCAGGTTT GGCGATGGGT TATTGGAAAG ATCGACAAAT TGTTCCATTA ACGAATAACC 2 076 6 0 

AAGGCTGGAT TCAGACAAAA GATAAAGGCA TTTGGCAAGA GGGCGAACTT GTTATTATCG 2 07720 
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GACGACTTGA TAATATGTTT ATTTCAGGTG GCGAAAATAT TCAGCCAGAA GAAATTGAAC 207780 
AAGTGATTAT TCAACATTCT TCAGTTAATC AAGTATTTGT TTTACCACAA AAAAACAAAG 207840 

AATTTGGTCA GCGTCCTGTC GCTTTAGTGG ATTTTAATGA GCCCTTTAGC AAAAGTGCGG 207900 

TTGAAAATTT AATGTTTTTT TTACAAGATA AACTTGCACG TTTTAAACAA CCTATTGCAT 20 7950 

ATTATCCTTT GCCACTGATG CTTGAGAAAG G CATTAAG AT CTCACGTAAA CAGCTTGCTG 208020 

ATTGGTTGGC AAAGCGAGAT GAGATAAATT AATTTACCAT TTAGATTAGG GAAAAATAAT 208080 

GATTAGGAAA CTTATGAAAA TACCTCCATT TTTTACCGCA CTTTTTGCCT CGGCAATGTT 20814 0 

TACCCTTAGC GTTTCGCAAG GGGTATTGGC GG CAAATTC A ACAAATGTAT TACCAACGGA 208200 

GCAATCGCTT AAAGCCGATT TGGCTAACG C TCAAAAAATG AGTGAGGGGG AGGCAAAAAA 20826 0 

TAGGTTGTTA GCAGAATTAC AAACATCAAT TGATTTATTG CAGCAAATTC AAGCCCAACA 2 08 32 0 

AAAAATCAAT GATGCATTGC AAACAACATT AAGCCATTCA GAGAGTGAAA TTAGAAAGAA 2 08 3 80 

TAATGCAGAA ATTCAAGCAT TAAAAAAACA ACAAGAAACG GCAACAAGTA CTGATGATAA 2 0844 0 

TGCGCAATCA CAGGACTATT TACAAAATAG TCTCACTAAG TTAAATGATC AATTACAAGA 2 08 500 

TACGCAAAAC GCATTAAGCA C AG CAAATG C ACAATTGGCA GGGCAAAGTT CTATTTCTGA 2 08 56 0 

ACGAGCACAA GCGGCATTAA CAGAAAATGT TGTACGCACA CAGCAAATCA ATCAACAATT 2 0362 0 

AGCCAATAAT GATATTGGTA GCACATTGCG AAAACAATAT CAGATTGATT TACAACTGAT 2 0868 0 

TGATTTAAAG AATAGTTATA ATCAAAACTT ATTAAAAAAT AATGATCAGC TTTCTTTACT 2 0874 0 

CTATCAAAGC CGATATAACC TATTGAATTT ACGTTTGCAA GTACAGC AG C AAAATATTAT 2 08800 

TGCGATTCAA GAAGTCATTA ATCAAAAAAA TCTTCAGCAA TCACAAAATC AAGTAGAACA 2 08860 

AGCTCAGCAA CAGCAAAAAA CTGTGCAAAA TGATTACATT CAAAAAGAAC TTGATCGCAA 2 08 920 

TGCACAGCTT GGTCAGTATT TATTACAACA AACAGAAAAA GCGAATTCAT TAACTCAAGA 2 08 98 0 

TGAGTTAAGA ATGCGGAATA TTTTAGATAG CCTCACTCAA ACT CAACGT A CTATTGATGA 2 0904 0 

ACAAATTAGT GCATTACAAG GCACGTTAGT TCTTTCACGT ATTATCCAGC AACAAAAACA 2 09100 

AAAATTACCG ACTAACTTAA ATATTCAAGG TTTATCAAAA CAAATTG CCG ATTTGCGTGT 2 0916 0 

ACATATTTTT GACATTACTC AAAAACGCAA TGAACTTTAT GATCTAGATA ACTATATTAA 2 0922 0 

TAAAGTTGAA AGTGAAGATG GAAAACAATT TACTGAGGCA GAAAGAACGC AAGTTAAAAC 2 09280 

GCTATTAACT GAGCGTCGGA AAATGACATC TGATCTGATT AAATCTTTAA ATAATCAATT 2 0934 0 

AAATCTCGCG ATTTCTTTGG AATTAACACA GCTGCAAATT ACACAAATCA GTGATCAAAT 2 09400 
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TCAATCTAAA TTAGAGCAGC AAAGTTTTTG GGTGAAAAGT AATAATCCTA TTAATTTAGA 2 0 9460 

TTGGGTAAAA ATGCTGCCAA GAGCTTTAAT TGAACAGTTT AATGGTATGT TGAAAAAATT 209520 

AGGTTTTCCA ACTAATTATG ACAATTTACC TTATTTGCTG ATGTATTTCT TAGGGTTATT 2 0958 0 

TATTGTAGGC GGTGCAATTT TTAAATTTAA AAATCGTATT AAACAACAAT TAAACAAAAT 2 09640 

TAATCGTGAA ATACATCGTT TAGATACAGA TAGTCAGTGG AGTACTCCAC TTGCCCTGTT 2 09700 

ATTAACAGCG TTTTTAACGC TTTCTAGTAC ACTTTGGTTT TTAGCAGTTT GCCAAATGAT 2 0 9760 

CGGCTTCTTT TTCTTCAAAA ATCCAGAAGA ATTTTGG C AT TGGTCATTTA GTATGG CGGG 20 9820 

TTATTGGTGG TTCTTTACAT TTTGGATTTC ATTGTTCCGT C C AAATGGT A TTTTTGTTAA 2 0 98 80 

TCATTTTGAA TCTTCAAAAG AGAATG C AC A ACGTTTTCGT GGTGTTATCC AGCGCATTAT 2 0 994 0 

TGTTGTGGTT GTATTGCTTT TAAATACTTC TGTGTTTAGC AATGTCACTG ATGCAGGTTT 2100 00 

AGCTAATGAT GTGTTAGGTC AAATTAATAC TATTGCGGCT CTAATTTTCT GTGCGGCGAT 210060 

TATTGCCCCT CGTTTTAATC GAGTACTTCG CTCTTATGAA CCTGAAACAA ATAAACATCA 210120 

TTGGTTAATA AGAATTGTAC AAATTGGTTT TAGATTAATT CCTGTGGGAT TAATCGTAtT 21018 0 

TATTGTTTTA GGCTATTACT ACACCGCTTT AAATTTAATT GAGCATTTTA TTCATTCTTA 210240 

TATTGCTTGG TGTGTATGGT GGTTAGTACG CAATACGATT TAC CGTGGT A TTACGGTTTC 210300 

TTCTCGTCGT TTAGCACATC GCCGTTTAGC AGAAAAACGT CGTCAAAAAG CACTTGAAAA 2103 6 0 

TAATTATGAA AATATATCCT CTGATGATGT GGTTGCAGTG GGAGAGCCAG AGGAAAGTTT 210420 

GGCGTTAAAT GATGTGCGTA GCCAATTATT ACGTTTTGTA GATCTCTTTA TTTGGACAGC 210480 

ATTATTAGGG ATCTTCTACT ATGTATGGTC AGATTTAGTC ACCGTAGTGA GCTATTTACG 210 54 0 

TGAAATTACA CTTTGGCAAC AAAC CACGAC AACCGATGCT GGCACTGTAA TGGAAAGCAT 210600 

TACTTTATTT AATCTTCTTG TTGCTTTGGT TATCGTAGGA ATTACTTATG TGTTGGTTCG 21066 0 

TAATATTTCA GGCATTTTGG AAGTATTAAT TTTTTCTCGC GTGAATCTTT CACAAGGTAC 2107 20 

GCCTTATACG ATTACGACAC TACTTACTTA TATTTTTATC GCCATTGGTG GTGCGTGGGC 21078 0 

ATTTGCAACC TTAGGGATGT CTTGGTCAAA ATTACAATGG TTATTCGCCG CACTTTCTGT 210 84 0 

TGGTCTTGGT TTTGGTATGC AAGAGATTTT TGCAAACTTT GTGTCAGGTA TCATTTTGTT 210 900 

ATTTGAACGC C CAATCCG AG TTGGCGATGT GGTTACCATT AATGAAGTGA GCGGTACTGT 210960 

AG CG AAAATT CGTATCCGTG CAATTACATT GATTGATTTT GATCGCAAAG AAGTGATTGT 2110 20 

GCCAAATAAA TCTTTTGTGA CTGGTCAAGT GACCAACTGG GCATTATCTA ACACTATGAC 211080 
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ACGTTTAGTG ATTAGCGTTG GTGTTGCTTA TGGTTCTGAT TTAACACTCG TGCGCCAATT 21114 0 

ATTACTTCAA GCAGCTGATG AACAACCGAC TATTTTACGC GATCCTAAAC CATCTGCTTA 211200 

TTTTTTAACT TTTGGTGCAA GTACACTGGA TCACGAGTTA CGTGTTTATG TAGAACAAGT 21126 0 

TGGGGATCGC AC CAGT ACT A CAGATGCCCT TAATCGCCGT ATTAATGAAT TATTTG CAGA 2113 2 0 

ACATAATATT GATATTGCCT TTAATCAATT AGATGTATTT ATCAAAAATA ATGATACAGG 2113 8 0 

CGAAGAAATT CCTTTTGTTG ATGTAAAAAA AT AAGGG CAA ATTATGGCTG GTAATACAAT 21144 0 

TGGACAACTT TTCCGTGTGA CAACCTTTGG AGAGTCACAT GGTATTGCAT TAGGCTGTAT 211500 

CGTTGATGGC GTGCCACCAA ATCTCGAATT ATCCGAGAAA GATATTCAGC CAGATTTAGA 21156 0 

TCGTCGTAAA C C AGGAAC AT CTCGATATAC GACGCCTCGT CGTGAAGATG ACGAAGTTCA 21162 0 

AATTTTATCT GGTGTGTTTG AAGGAAAAAC CACAGGCACA AGTATTGGGA TGATCATTAA 21168 0 

AAATGGAGAT CAGCGTTCGC AAGATTATGG TGACATTAAA GATCGTTTCC GCCCAGGTCA 21174 0 

TGCGGATTTT ACCTATCAGC AAAAGTATGG AATCCGTGAT TATCGTGGCG GTGGGCGTTC 211800 

GTCAGCACGT GAAACAGCGA TG CGGGTTGC TGCAGGGGCT ATTGCGAAAA AATATTTACG 211860 

CGAACATTTT GGCATTGAGG TGCGAGGTTT TTTAAGCCAA ATCGGTAATA TAAAAATTGC 21192 0 

TCCGGAGAAA GTGGGACAAA TTGATTGGGA AAAGGTAAAC AGTAATCCAT TCTTTTGTCC 211980 

TGATGAAAGT GCGGTAGAAA AATTCGATGA ATTGATCCGT GAACTTAAAA AAGAAGGAGA 21204 0 

TTCTATTGGC GCAAAACTTA CTGTTATTGC AGAAAATGTA CCTGTAGGAT TGGGCGAGCC 21210 0 

AGTATTTGAC CGTTTAGATG CCGATCTTGC TCACG C ATT A ATGGGAATTA ATGCAGTAAA 21216 0 

AGGTGTAGAA ATTGGCGATG GCTTTGCTGT GGTTGAACAA CGAGGTTCGG AACATCGTGA 212220 

TGAAATGACA CCTAATGGCT TTGAAAGTAA TCATGCAGGC GGTATTTTAG GCGGAATTAG 2122 80 

TTCAGGACAA CCAATTATCG CCACTATTGC ACTAAAACCA ACTTCAAGCA TTACGATTCC 212 34 0 

TGGTCGTTCA ATCAATCTTA ATGGTGAAGC CGTAGAAGTT GTAACAAAAG GTCGTCACGA 2124 0 0 

TCCTTGTGTG GGGATTCGTG CTGTGCCAAT TGCGGAAGCT ATGGTGGCGA TTGTCTTATT 21246 0 

AGATCATCTC TTACGTTTTA AGGCACAGTG TAAATAATCC TGTAC CTAGG GTAGTATACT 212 52 0 

CTAGGTTAGT TCTTCAATAG GACTGAACCT AGGATATACT TACCTACAGC AAAACTTTAG 212 58 0 

AATGAATAAA ATTTTATTAA AAAC AAC CAT AATTTTTACC GCACTTTTCT CTCTGAACGT 2126 4 0 

TGTTGCGTCT CCTCTAGATT GGCAAAAAGT GAAACGCCCT ATTCCAAGTG AAGATGGTAA 212 70 0 

AGCAAGTCCT ATTGGTAGTT ATACCAACGG GTGTATTATT GGAGCACAAG CATTACCGCC 212 76 0 
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GAAAGGTGAA GGTTACCAAG TGATTCGCAT GAATCGCAAT CGTTATTATG GTCATCCAAA '2 12820 

TATGATTCAG TATCTTGAAC GTTTGGGGCA GCGAGCTAAA GCTGCTGGGT TGCCAACGAT 2128 80 

GTTAGTAGGC GATATTGCCA TGCCCGGTGG TGGGCGGTTT TTAACGGGTC ATGCTAGCCA 212 94 0 

TCAGATGGGA TTAGATGCGG ATATTTGGTT AAGAATGGGC GAAATGTCAG ATGCCGATGC 213000 

GCTCAATTCT GATGGTAAAG GTTTATTAGT TGTGGATAGA AAGGCGCAAC GAGTAGATGA 213060 

GCGCGTGTGG AATAGCAATC ATGCGACGTT AATCAAGTTA GCTGCGCAAG ATCCAAATGT 21312 0 

GACGCGTATT TTTGTTAATC CTGCGATTAA GGTAAAATTA TGTCAAACCG CTGGCAATGA 213180 

TCGCGGTTGG CTACACAAAA TTCGTCCTTG GCATGGACAC AACTCTCATT TTCATGTTCG 213240 

TTTAACTTGC CCAGCAGATG CATCTTATTG TGAAAATCAA GCGCCTGTGC CTGCTGGTGA 213 3 00 

TGGTTGTGGA GACGAATTAT ATTCTTGGTT TGAACCGCCA AAACCAGGCA CTTCAGTGAG 213 360 

CAAACCTAAA GTTACACCGC CTGAGCCGTT TTTGTGCCAA CAGATTTTGA ACTCACCGAA 2134 2 0 

TCGGAGAGAA TGGTTAGAAT AGCATTGAGG TAAATCAATA TGGATATCGG CATTGATCTT 2134 80 

TTAGCAATAT TGTTTTGTGT TGGTTTTGTC GCATCATTTA TCGATGCAAT TGCTGGCGGT 213 54 0 

GGTGGATTAA TCACCATTCC AGCGTTACTC ATGACAGGTA TGCCACCAGC AATGGCGTTA 21360 0 

GGCACCAACA AATTGCAAGC TATGGGCGGT GCATTATCCG CAAGCCTTTA TTTCTTGCGA 213660 

AAAAGAGCGG TCAATTTACG CGATATTTGG TTTATTTTGA TTTGGGTTTT CTTAGGTTCT 21372 0 

GCCCTAGGTA CATTATTAAT TCAATCAATT GACGTGGCGA TTTTCAAAAA AATGCTTCCT 213780 

TTTTTGATTT TAGCCATTGG TCTATATTTT TAATTTACTC CTAAATTAGG TGATGAAGAT 213 84 0 

CGAAAACAAC GATTAAGTTA TCTGTTATTT GGTCTTTTAG TTAGCCCATT TTTAGGTTTT 213 900 

TATGATGGCT TCTTTGGGCC AGGGACTGGC TCAATCATGA GTTTAGCCTG TGTTACTTTG 213 960 

CTAGGATTTA ATCTCCCGAA AGCGGCAGCA CATGCAAAAG TGATGAACTT CACTTCGAAC 214 02 0 

CTTGCTTCTT TTGCACTTTT CTTATTGGGC GGACAAATTC TTTGGAAAGT GGGTTTCGTG 214 08 0 

ATGATGG CTG GG AG C ATTTT AGGTGCAAAT TTAGGTG CCA AAATGGTGAT GACGAAAGGT 21414 0 

AAAACCTTGA TTCGACCGAT GGTTGTTATC ATGTCTTTTA TGATGACGGC TAAAATGGTT 2142 00 

TACGATCAGG GTTGGTTTCA TTTTTAATTC GGAAAGCGCG CAAAAGTGCG GTTAAAATTA 21426 0 

ATTACATTTT ATTATGTCGG ATAATCAACA AAATTTACGT TTGACGGCGA GAGTGGGCTA 21432 0 

TGAAGCGCAC TTTTCATGGT CGTATTTAAA GCCTCAATAT TGGGGGATTT GG CTTGGT AT 2143 8 0 

TTTCTTTTTA TTGTTGTTAG CATTTGTGCC TTTTCGTCTG CGCGATAAAT TGACGGGAAA 21444 0 
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ATTAGGTATT TGGATTGGGC ATAAAGCAAA GAAACAGCGT ACGCGTGCAC AAACTAACTT 214 50 0 

GCAATATTGT TTCCCTCATT GGACTGAACA ACAACGTGAG CAAGTGATTG ATAAAATGTT 214560 

TGCGGTTGTC GCTCAGGTTA TGTTTGGTAT TGGTGAGATT GCCATCCGTT CAAAGAAACA 2146 2 0 

TTTGCAAAAA CGCAGCGAAT TTATCGGTCT TGAACATATC GAACAGGCAA AAGCTGAAGG 214680 

AAAGAATATT ATTCTTATGG TGCCACATGG CTGGGCGATT GATGCGTCTG GCATTATTTT 214740 

GCACACTCAA GGCATGCCAA TGACTTCTAT GTATAATCCA CACCGTAATC CATTGGTGGA 214 8 00 

TTGGCTTTGG ACGATTACAC GCCAACGTTT CGGCGGAAAA ATG CATGCAC GCCAAAATGG 214 86 0 

TATTAAACCT TTTTTAAGTC ATGTTCGTAA AGGCGAAATG GGTTATTACT TACCCGATGA 214 92 0 

AG ATT TT G GG GCGGAACAAA GCGTATTTGT TGATTTCTTT GGGACTTATA AAGCGACATT 214 98 0 

AC CAG GGTTA AATAAAATGG CAAAACTTTC TAAAGCCGTT GTTATTCCAA TGTTTCCTCG 21504 0 

TTATAACGCT G AAACGGG C A AATATGAAAT GGAAATTCAT CCTGCAATGA ATTTAAGTGA 215100 

TGATCCTGAA CAATCAGCCC GAGCAATGAA CGAAGAAATA GAATCTTTTG TTACGCCAGC 21516 0 

G C CAG AG C AA T ATGTTTG G A TTTTGCAATT ATTGCGTACA AGGAAAGATG GCGAAGATCT 215220 

TTATGATTAA AAATAGGGTT TGAAAGTCAT ATTCTCGAAA ACGGATATTC AATAAGGTAA 215280 

ATAAGAAATG GAAGAAAAAA TTCGGCTGAC GCAATACAGC CATGGTGCAG GTTGAGGCTG 215 340 

TAAAATTTCG CCTAAGGTGT TAGGGACAAT TTTACATTCA GAACTGGAAA AATTTTATGA 215400 

wCCAAATCTG ATCGTTGGCA ATGAAACTGC AGACGATGCG GCGGTGTATG ATCTTGGTAA 215460 

TGGAACTGCA ATTATCAGTA CCACAGACTT TTTCATGCCG ATTGTGGATG ATCCTTTTGA 21552 0 

TTTTGGACGA ATTGCTGCGA CCAATGCCAT CAGTGATATT TTTGCTATGG GCGGTAAGCC 21558 0 

GATAATGGGA ATTGCTATTT TAGGTTTTCC AACCAATGTA TTACCTGCAG AAGTTGCACA 21564 0 

GAAAATTGTT GATGGTGGGC GTTTTGCTTG CCATCAAGCG GGGATTGCTT TGGCGGGTGG 215700 

ACATTCCATT GATTCGCCTG AACCTATTTT TGGTTTAGCC GTGACAGGCG TGATTGACAC 21576 0 

TGAAAAAGTG AAACGTAATG CTTCTGCAAA ATCAGGTTGT AAGCTTTATA TGACGAAACC 215820 

GCTTGGTATT GGTATTTTGA CGACGGCGGA AAAAAAAGGA AAACTAAAAC CAGAACATCA 215 8 80 

AGGTTTAGCC ACTGCCGCTA TGTGCCAAAT GAATTCTATT GGTAGTCAAT TTTCCCAAGT 215 94 0 

GGATGGCGTC ACCGCAATGA CGGATGTCAC TGGATTTGGT TTACTTGGTC ATTTAATTGA 216 00 0 

AATCTGTGAA GGTTCAAATC TAAGTGCGGT AGTTTTTTCG GACAAAATTA AAACCTTGGA 216 06 0 

CGGTGTAAAA GATTATATTG CTCAAGGTTG CGTAC CGGGT GG AACAGGGC GTAATTTTGA 21612 0 
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CAGTTATGGA CATAAAGTTG GGATATTGAC GGAAGAACAA AAAG CG ATTT TGTGTGAXCC 216180 
TCAAACATCA GGCGGCTTAT TAGTGGCAGT TGAGCTTAAT AGCGTTCAAA CAGTAATTGA 216240 
CATAGCCAAA GATGCGGGGA TTGATTTATA TGAAGTGGGT AAATTGAAAC CTAAATCAGA 216300 
ATCCGATATC GTTGTTGAAG TAAAATAAGG AATTTTAATC GATCCTCCCT AATGATTAAT 21S 3 60 
TAAATTACCA TGATCCACAT AGCAAAAAAC CTTGGAATAT CCCAAGGTTT TTTAGATTAT 216420 
CATTAAATAT AATGAATTAT GCGCCTAAAC GCTCTTTAAT ACGAGCTGAT TTACCTGAAC 216480 
GTTCACGTAA GTAGTAAAGT TTAGCTTTAC GTACCGCACC TTTACGTTTA ACTGCGATAG 21654 0 
AATCTAC AG C TGGAGAGTGA GTTTGGAATA CACGCTCAAC GCCTACGCCG TTAGATACTT 216600 
TACGTAAAGT AAAXGCTGAG TGCAAGCCAC GGTTACGAAT TGCAATAACC ACGC CTTCGA 216660 
ATGCTTGCAA ACGACGTTTG CTACC^CAA CCACCCATAC TTTAACTTCT AAAGTATCAC 216720 
CTGGGCGGAA G CTAGGTACG TTTTGTTTTA ATTGTTCTTG TTCAAGTTGT TTGATAATGT 216780 
TAGACATTGT TTGATCCTTT ATTGATCCTA GATGTAACTG AAACTAACTG TTATGCTCGG 216840 
CTTGCGCCTC TTTTAACAGC TTACGTTGTT CGTCAGTCAG AGCTAGGCCT TCTAAGAGCT 216900 
CAGGGCGTCG GAGCCACGTT CTCTGTAGCG ATTGTTTTAA TCGCCATTTC CGAATTTCTT 216960 
CGTGATGTCC CGACATCAGC ACGGGTGGTA CAGTTAATCC TTCTAACACT TCTGGTCTAG 217020 
TGTAATGCGG GCAATCTAAT AAACCATCTG CAAAAGAATC TTCTTCTGCC GAGGCTTGTT 217080 
TGCCTAATAC ACCGGGAATA AAACGTGCAA CAGCATCAAT TAATGTCATT GCCGGCAATT 217140 
CCCCACCGGT CAGAACGTAA TCGCCGATTG ACCATTCTTC ATCAATTTCA G TTTG AATCA 217200 
ACCGTTCATC AATCCCTTCG TAACGTCCAC ATACTAAAAT CAATTTCTGA TTTTGAGCAA 217260 
GCTCGGTTAC GCCGCCTTGA TCGAGTTTAC GTCCTTGTGG CGAAAGGTAA ATCACCTTTG 217320 
CGCCTTCTCC TGCCGCAGCT TTTGCAGTAT GAATCGCATC CCGTAAAGGT TGCACCATCA 217380 
TTAGCATTCC TGGGCCACCA CCATAAGGGC GATCGTCCAC GGTTTTATGC TTGTCGAATG 217440 
TAAAATCTCT TGGATTCCAA CATTCAACTT TCAGAAGATT ATGTTTTACG GCTCTACCTG 21750 0 
TAACCCCAAA TTCCGTAATT GCTTTAAACA TTTCGGGGAA TAATGAAATT ACCCCTATCC 2175 60 
ACATAAAAAG CCTACTACAT TACGTTTGTG CATACTTGAG ATTAGAAACC AGCGTCCCAA 21762 0 
TCCACTTCAA TAGTTTTCGT GGTGAGATCG ACTCTTTTAA CTACTTGTTC ATACAAAAAC 217680 
GGAATTAACC GCTCTTGTTT TCCAAAAGCA TCTTTGGTAT TGGCTTTAAC CACTAATACA 217740 
TCATTAGAAC CCGTTTCCAT CATTTCTGTT ACTGTTCCCA TTGTATAACC TTCTAAGTTT 217e0 0 
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ACGACTGTAC 


' AACCGATTAA 


. ATCGTGCCAG 


TAATAATCGC 


CCTCTTCTAG 


TTCTGGGAAA 


217860 


ACAGATAAAT 


CTACACCAAT 


TTCAACATTC 


GCTAAAATTT 


GTGCAGCTTC 


ACGGTCATCA 


217920 


ACGCCTTTTA 


ATTTAACGAT 


GATTTCGTGA 


TTATGATAAC 


GCCAGTTTTC 


TAATTCAATT 


217980 


GATTGCCATT 


CGCCTTTGAT 


TTTTAAAAAC 


CAAGGTTGAT 


AATCAAAAAT 


GCTTTCAGCT 


218040 


TGTTCTGTTG 


ATGAATAAAT 


ACGCAACCAC 


CCACGAATAC 


CGTAGGTTGA 


GCCTAATTTG 


218100 


CCCACAACTT 


CAATATGTTG 


TTGTTCCATA 


TTTTTCACCT 


ATCTTAGTTT 


GAGAACTAAA 


218160 


TCAAAAAGCA 


AAATTATGCT 


GCTTTTTGTG 


CTTCTTTTAC 


CAAGCTCGCA 


ACACGATCAG 


218220 


AAAGTGAAGC 


ACCTTG AG C A 


AC CCAGTGAT 


TTACACGCTC 


AAGATTAATG 


CGAAGACGTT 


218280 


CTGCATTACC 


TTGTGCAATT 


GGATTGAAGA 


AACCTACACG 


TTCAATGAAG 


CGACCGTCAC 


218340 


GAGGTGAACG 


ACTGTCAGCG 


ACTACGATTT 


GATAAAATGG 


GCGTTTTTTA 


GCTCCGCCAC 


218400 


GAGATAAACG 


AATGGTTACC 


ATAACCTTCC 


TCTAAATGTT 


TAATTACTAA 


AAGAATATTC 


218460 


TCAAATTCGA 


AAGTGATTTG 


AG AAT AT AG C 


TTACTTTTTT 


CTCTGTATAT 


ATAGACAAAA 


218520 


AGCCTGCGAA 


TTTTATACTT 


TTC AAG C AAA 


ATTG CAAGTT 


TTGTATGCAT 


AAAGTAGGTT 


218580 


GTATTCCTTT 


TAATTATCAT 


T C AC AAT TT A 


TTTTGTTTTA 


CGATATAATA 


CGAGGCATCC 


218640 


AGCTGTTTTA 


GGTCAAATTT 


ACGCGTATAG 


CGGTATTTTA 


G CTTG AT AAA 


ATATCAGTTA 


218700 


AATGAGGTTT 


ATATATGAAA 


ATTAGTTTTC 


ATTCTGTGTT 


AATTGGATTA 


GCATCATTTA 


218760 


TTGGTGTTCA 


ACAAGGTGTA 


AT AG CTAATC 


CGTCTTCTCA 


TCAATCAATA 


TCCACTGAAT 


218820 


CAGCTGAGGC 


TTTGAAGCAA 


CAATTTTCAA 


TGGCTTTAGC 


TAAACAAGAT 


AAACAACAAA 


218880 


TTACAAATTT 


ACAAAAAAAA 


CTTACCGCAC 


TTTTTTCTTT 


ACCTCCCCAA 


TTTCTTGATA 


218940 


ATCAGATCCA 


AATAAGCGAA 


AAAAnCCTCA 


CTCG CATTTT 


TAAAACAGAT 


AAAAAT CTCA 


219000 


CGCCTAAATT 


TCTTGATTAC 


TTATACTTTG 


AG CCAATAAA 


TACTGTTGAT 


G CTAAT TTAA 


219060 


TTCAGGAAAT 


GAAGAAAAAC 


TTACTGGTGT 


CTTTTTT AG C 


AAATGATCAG 


GCAAAAATTT 


219120 


ATATTCGTCA 


AACAGATAAT 


TCAGAGCAAT 


TTGTTCAGAC 


TTTAATGGAG 


CGGGG AG CAA 


219180 


AAGCAGATCA 


AATTATATTA 


CTGTCTTTGA 


ATGCCAAAGG 


TATATTTCAA 


AAAATTATTG 


219240 


AACAAATACG 


TCAAGATTTT 


CCGAATCAGA 


CTATTTTTT C 


GATTACTGAA 


AATCGTATTA 


219300 


GTTTGATTAC 


TCCTTCTTCA 


GAAATTAAGT 


CTCGCCTTGC 


TTT AG CAAAT 


ATGATGTTCA 


219360 


ATCGCCAATT 


TAAAGGTGTG 


GAAGTTGATG 


ATTTCTCATA 


TTTAGATCAA 


CCTCGAGAAA 


219420 


ATTTACAGCA 


CAATAATGAT 


GCAATTCGTT 


ACAAAACCTT 


TCAAGCAATG 


CTAGAAGGTT 


219480 
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TAAATTAATC CCTTACCAAT AGGAGTTTTT ATGCTTTTAT CCAAAAAATC AGCCTCCTTT 21954 o 
GCACTCAGTG CATTTGCGAT GCTTTTCACT AGTGTAGCTC TTGCCAAAGA GGCACCACAA 219600 
GCTCACAAAG CTGTGGAATT AAG TATTTTG CATATCAATG ATCACCATTC TTATTTAGAA 219660 
CCGCACGAAA CACGGATTAA TTTAAATGGT CAGCAAACCA AAGTGGATAT TGGTGGTTTT 219720 
TCTGCTGTCA ATGCAAAACT TAACAAATTG CGTAAAAAAT ACAAAAATCC ATTAGTACTG 219780 
CATGCAGGCG ATGCCATTAC TGGTACACTT TACTTCACGC TGTTTGGTGG TTCTGCAGAT 219840 
GCAGCTGTGA TGAATGCAGG TAATTTCCAT TATTTTACTT TAGGTAATCA TGAATTTGAC 219900 
GCGGGTAATG AAGGGTTATT AAAACTGCTT GAACCATTAA AAATCCCAGT GCTTTCAGCT 219960 
AATGTGATTC CTGATAAAAA TTCAATTTTG TATAACAAAT GGAAACCTTA CGATATTTTC 220020 
ACTGTGGATG GAGAAAAAAT TGCCATTATC GGTTTAGATA CTGTGAATAA AACAGTGAAT 220080 
TCCTCTTCTC CGGGTAAGGA TGTGAAGTTC TACGATGAAA TTGCTACTGC ACAAATTATG 220140 
G CAAATGCGC TAAAACAGCA AGGAATTAAT AAAATTATC C TACTTTCACA CGCAGGTAGT 220200 
GAAAAAAATA TCGAAATTGC TCAAAAAGTA AATGATATTG ATGTGATCGT TACTGGCGAT 220260 
TCACATTATT TATACGGAAA TGATGAATTA CGTAGTTTAA AACTTCCAGT AATCTATGAA 220320 
TATCCACTTG AATTTAAAAA TCCAAATGGA GACCCTGTAT TTGTAATGGA AGG CTGGGCT 220380 
TATTCTGCCG TGGTGGGGGA TTTGGGTGTT AAATTCAGTC CTGAAGGTAT AGCGTCTATT 22 0440 
ACTCGTAAAA TTCCTCATGT GTTAATGAGT TCTCATAAAC TTCAAGTGAA AAATG CGGAA 220500 
GGTAAATGGA CGGAATTAAC TGGCGATGAA CGTAAAAAAG CACTTGATAC TTTAAAATCA 220560 
ATGAAGAGTA TTTCACTTGA TGATCATGAT GCAAAAACAG ACATGCTTAT TTCAAAATAT 220620 
AAAAGTGAAA AAGATCGTTT AGCACAAGAA ATTGTTGGTG TCATCACAGG TTCTGCAATG 220680 
CCGGGTGGTT C AG CAAAC CG TATCCCAAAT AAAGCAGGCT CAAATCCAGA AGGTTCTATT 22074 0 
GCAACGCGTT TTATTGCAGA AACAATGTAT AACGAACTCA AAACAGTGGA TTTAACTATT 22 0800 
CAAAATG CTG GCGGTGTACG CGCAGATATT TTACCGGGTA ATGTAACCTT TAACGATGCT 220860 
TATACTTTCT TACCTTTCGG GAATACGTTA TATACCTATA AAATGGAAGG TTCATTAGTG 220920 
AAACAAGTGC TTGAAGATGC AATGCAATTT GCTTTGGTTG ATGGCTCTAC AGGCGCATTC 220980 
CCTTATGGTG CAGGTATTCG TTATGAAGCG AATGAAACAC CAAATGCGGA AGGTAAG CGT 221040 
TTAGTGAGTG TTGAAGT CTT GAATAAACAA ACCCAACAAT GGGAACCGAT TGATGATAAC 221100 
AAACGTTATC TTGTCGGAAC AAATG CTT AT GTTGCAGGCG GTAAAGACGG TTATAAAACC 22116 0 
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TTTGGTAAAT TATTTAACGA TCCAAAATAT GAAGGCGTTG ATACCTACTT ACCTGATGCA 221220 
GAAAGTTTCA TAAAATTTAT GAAAAAACAT CCGCACTTTG AGGCTTACAC TTCATCAAAT 221280 
GTGAAATTTA ATGCTTCAAC TGATGCATTA CCTAAAAAAT AAAATTGAAA AATAGGCTAA 221340 
AAATAGCGCG TATTTAACGC GCTATTTTTG TTATTAGTCC TAGGTTTGAA GACTTCATAG 2214 00 
ATGATTAAGA ATTTTAACTA TTCCATATTT AGATGCTTTC TCGTGGAAAA TAAATTCCCT 221460 
ATAGAGGTTG AAAAATAGGG TGATTTATTG CGATAATCTA CCCCATTTTT TGGGAGATCT 221520 
TTCTATCTTC TGCATTTTTA CGCAGTTTAT TTTCATTAGA AATGATCGAA TTTTATTTCA 221580 
CTTTTAATTT AATTAGATAA AGAACAATGG CGGAAAAACG TAATATTTTT TTAGTAGGAC 221640 
CAATGGGTGC AGGTAAAAGT ACTATCGGTC GTCAATTAGC ACAACAGCTT AATATGGATT 221700 
TTATTGATTC CGATGCAGTG ATTGAAGAAC GTACGGGCGC GGATATTAGC TGGATTTTTG 221760 
ATTTGGAAGG CGAAGACGGC TTCCGAAAAC GTGAAGAACG CATTATTAAC GAACTCACAC 221820 
AAATGCAAGG TATTGTACTT TCAACTGGTG GGGGAGCAGT GCTTTCTAAG GAAAACCGTA 221880 
ATTATTTATC AGCACGTGGT ATTGTGATTT ATTTAGAAAC AACGGTGGAA AAACAATTCC 221940 
AACGCACTCA ACG TG AT AAA AAACGTCCAT TGCTCCAAGA TGCAGAGAAT CCTCGTCAAG 222000 
TATTGGAAGA TTTAGCAAAA ATTCGCAATC CG CTTTATG A AGAAATTGCG GACATCACTT 222060 
TGCCAACCGA TGAACAAAAT GCCAAAATAA TGGTAAATCA AATTGTTGAT TTAATTGATA 222120 
ATATGAATGG GTTGAATGGT G CACTTT AAA C AC AC CTAAG GATAAATTAT GCTGTGCGTC 2 2218 0 
AACGTGGAAT TGCAAGAGCG TCGCTATCCT ATTTTAATTG GTAGCGGATT GTTACAAGAT 222240 
GAACGTAGTT ACCCGATTAA ACGTGGAGAT CGCGTGATGA TTGTAACGAA TCCCACGGTT 222300 
GCACAGTTTT ATCTTGATAC AGTCATCTAT GCGTTGGAAA AACGTGGTTG CGTAGTCGAT 222360 
CATGTTTTAT TGCCAGATGG CGAAAAATAC AAAACACTTG AATCTTTAAA CTTAATTTTC 222420 
ACCGCACTTT TGCAAGGTAA TCATGGGCGA GATACCACGA TTATTGCATT GGGCGGTGGC 222480 
GTAATTGGCG ATGTTGCTGG CTTTGCTGCG GCAAGTTATC AGCGCGGCGT GCGTTTAATT 2 22 54 0 
CAAATTCCAA CAACATTGCT TTCACAAGTG GATTCTTCAG TGGGCGGTAA AACGGCGGTT 222600 
AATCATGAAT TAGGCAAAAA TATGATTGGC GCGTTCTATC AGCCATCAAT GGTTATTATT 22266 0 
GATACG CTCA CGCTTAATAC ACTTCCTAAA CGTGAAGTAA ACGCGGGGCT TGCCGAAGTT 222720 
ATCAAATATG GTGCCATTTT AGATTACGAA TTTTTTGAAT GGCTAGAGCA ACATATTGAT 222780 
G AACTTG TCG CTTTGCATCC TGAGGCGCTT CAACATTGTA TTTCTCGTTG TTGCCAAATT 2 2284 0 
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AAAGCAGATG TTGTCGCGAG AGATGAAACA GAAAAAGGCG ATCGTGCATT GCTTAATCTT 222 900 

GGACATACTT TTGGACATGC CATTGAAACT CACTTAGGTT ATGGAAACTG GCTTCATGGT 22 296 0 

GAAGCTGTTT CTACGGGAAT GATGATGGCA GCGGCGCTTT CGGAAGAACT TGGAGATATT 22 3 02 0 

TCTATTG CAG ATGTGTCTCG CCTTGAAAAA TTACTTGCTC GTGCCAATTT GCCCACCGTA 22308 0 

TCTCCAGATA CGATGCAGCC GGAGGACTAT TTACCGCATA TGATG CGTGA TAAAAAAGTT 22 314 0 

CTATCGGGTA AATTGCGTCT GGTATTACTC AAATCTCTAG GTCAAGCTTA TGTGGCGAAT 223200 

GATACCGAAC ATACCCTCGT GTTAAACG C A ATTCGTCGTT GTACTCAAAC GGATTAAGAT 223260 

GTTACGTCCG AAAAAACAAT CTTTAAAACC TAAGTTAAAA CACCGTCCAT TTTTAAAATG 22 3320 

GGCGGGGGGA AAATTTCGTT TAACGGACGA GATAAATAAA GCCTTTCCAA ACAAAAAAAA 22 3 38 0 

CTGCTTAATT GAG C C ATTTG TGGGCGCTGG AGCAGTATTT CTTAATTCTA ATTTTGAACG 2 23440 

CTATATTTTG G CAG ATATCA ATCCTGATTT AATCAATCTA TTCAATATTG TGAAAGnAAA 22 3 5 00 

TGTAGATGGT TATATTGAAG ATTGTAAGCC GATTTTCTTC GCTGATGATG CCAATACACC 223 56 0 

CGATTATTAC TATGCTAAAC GTCGCCAATT CAATGCATCA ACAGAACCTT TCGAACGTTC 2 2 3620 

GATTATTTTT TTGTATTTGA ATCGTTTTGG TTTTAATGGA TTATGCCGTT ATAACAGCAA 2 2 3680 

GAATGAATTT AATGTACCTT TTGGTG CTT A CAAAACTCAT TATTTTCCAG AGGATGAATT 22374 0 

ACGTTATTTT GCGCATAAAG CGCAAAGTGC GGTGTTTTTA TGCTGTGATT TTCAAAAAAC 22 38 00 

TTTTGAATTT GCCGATAAGG ATTCGGTGAT TTATTGTGAT CCACCTTATG CTCCGCTACA 22 3 86 0 

ACAAGAGACT AATTTTACTG GTTATGCGGG CAATGAGTTT GGACTGGCGC AGCAACGTGC 223 92 0 

TTTGGCAGAT TTAGCGAAAT CTATACAAAA AGAAAAACAA ATTTCGATAT TGATCTCGAA 22 3 98 0 

TCACGATACT AAATTTACGC GTGAAATTTA TAACGGTGCT AAATTTAAAC GGGTAAAAGT 2 24 04 0 

CCAACGTTCT ATTAG CCAAA ACCCAGAAAA ACGCGTAAAA GTGAAAGAAT TGATTGCGAT 224100 

ATTTGGTGCT CGTAAATAAT AAAAATAACC CTTACTTTAT ACTAAAAGTA CCAAAATCGT 22416 0 

AATTTTAAAA C CACAAT AAA ATAATATATT ATGTATTACC ATTTATCTCT TACAGTACCT 22422 0 

ATATACTTTT TTGATACTAA CTCAAAATAC TCACAAGTCT GTAATTTTAC CTGTAATCCT 2242 8 0 

TTTGTTT TAT CAGAATGATT ATCTAGAAAA CTCATCCAAT CATCAGAATA ATTAACAAAT 224 34 0 

AATTATTATC CTATAGTAGA ATGTAAATTA TTTTAATAAC G C CTT ATTTG TACTGTGGAT 2244 00 

AATTCATCAA AATTTAAGCC TGTTATGCTT TATTCTTTAG TGTATAACAA ATAGGATAAA 2 24460 

AATGGAAAAA G CAAAAC CAT CCAAGCAAAG AAATTTTAAT CACTGTCAAG GAGTACATTA 2 24 52 0 
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TTGAATTTGG TAATAAACCG TTAGGTTTAA AAGATACTAC AC C CAAATG C CACTGCCCTA 2 24 58 0 

TTTGTGCTCA AGTTTTATTT GAGAAAAATG TTAATACAGT GCCATCCTCT AATAGCAATT 2 2 4640 

TTTCTAATTA CCCAAAAAGC GAATTCTGCC CCATAAAATT AAACTAAAGC AATTTATACC 224 7 00 

AATCATATTT TCCATTATCA CCAAATAAAA CG CATGG ACT GAATTAAAAA ATTTATTCAG 224760 

AAAAAATTGG CAAAAATATT ATCGTCAAAT TAGTTTTTTG GCAAAAAACT TATAACCTAT 22482 0 

TGAATTTATT ATATTAGCTA ATCAACAAAA AATTTGGGAA TACGATAAAC TTGAAGAGTA 2 24 880 

TCAAATTCCT TATATTCTAT T AAC ATT AG C AGACTGTCCT AACCAAAAAG GAACATATAA 2 24 94 0 

AAATAAACGT CACTGCTGGT TTAGATTTTG GTTTGATGCT T C AATTC AAA GATATGAAGA 22 5000 

TATTTGGAAT AATAGTTCTT TACCAGTATT GTGGAC AAC C T ATT ATT AC C TATCAGGGAA 22 5060 

ACAAAGAAAA CCTTCAGAAA TAAATCTAAA AAACATGAAT TATTTCATCA TGGATAAAAC 22 512 0 

TTTCTTAGAA CAAGAAATAT TACTTCCCCA ATTTATTATT CAAAATATAG AACGGTGGTT 22 518 0 

TAAAACACAC AATTTTGTAT AG AC AAGT C A GCACCTAATA CTATTTCATG ACGAAGATCG 22 524 0 

CCTTTTTCTT TAAAAAGGCA AAAATAATGC TATTAATCAA CCATGCCAGT AATGTCGCCA 22 5 30 0 

CCAACAAATC TATTGGGTAA TGCATGCCTA ATCGCACACG GCTAATCAAC ATCAATAAGC 22 536 0 

CCCACACTGC AATACCCACC ACCAGCAATT TTGCTTTAAA AGAACGGTTA CCCAATAATT 22 5420 

GGGTAAAGCC AACAGCAAGC ATTAGCCAAG TTGCTGCAAA AATCGTATGT CCAGAAGGGA 225480 

ACGAATACCC CGTTTCATTT TCATAATGAT GAACTAACCA AGCTGGTGTA ATGGCATCCA 22 554 0 

TCGAATAAAA ATTCTTAGCT ATTTCAGCAC GCAAAGTGCG GTCATTTTTA TAAAAGTTTT 22 5 6 00 

CAGGCGTTGA ATGAGTTTGC TCAGCTAGAT AAACGGTGAA TGGACGAGGT TCTTCAAATA 2 2 5660 

ATGCTTTTGC CCCTGTTTTT GCGGCTTGAG TTGCAATAAC TGAAATCCCC ATCACAATTA 2 2 572 0 

CGCCTAATAT CCATTGTTTT GGATTTTTAA ATAAAAACGC AAACAAAAGC GTAAACAACA 22 5780 

CACAGGTGAT CAACGCATAA GGTACGCTAC CCGTTTCAGT GAGCAAATAA AGCCAATAAT 22 584 0 

CAGCTTGAGT TAATTGGCTA TTACCATGCC ACTGATAGGA AATACCCCAA ATAAAAAATG 22 5 900 

GTACAAGACA AAGTAATAAC GTATATAATG ACAGCCTTTT GAACATGCTC GCCCCTTGAT 22 596 0 

AACAAAAATA ACGCGCGCAT T CT AAC AG AA ATGACTACGA ATAAGGATCA CTTTATGGCA 22 602 0 

AAAATTCAG C TAGTTGCGCA AG C CAATTT A CCCACTGAAT ACGGTATTTT CAAAATGGTG 226 080 

GGATTTGAAT TCCCAGATAC AAAAAAAGAA CATGTAGCCT TAGTGATGGG CGATATTTCT 2 2614 0 

AATGCTGATG AACCTGTATT AGCACGTATC CATTCTGAAT GTTTAACTGG CGATGCGCTT 2 26 2 00 
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CACAGCTTAA AATGCGAT.G TGGGTTCCAA CTGGCGACAG CATTAAAACA AATTCAAGAA 226260 
GAAGGTCGCG GAGTCTTAAT TTATCATCGT GAAGAAGGTC GCGGAATTGG TTTGATTAAT 226320 
AAAATCCGTG CTTATTCTTT ACAAGATAAA GGAATGGATA CC ATTGAAG C TAATCTTGCT 226380 
TTAGGTTTCA AAGCGGATGA ACGCAACTTT GAAGTCTGTG CCGATATGTT TGAGCTACTT 226440 
GGTGTAAAAA AAGTTCGCTT AATGACCAAC AACCCAGAAA AAGTAGAAAC CATGAAAAAA 226S00 
OCAGGTATCA ACGTGGTTGA ACGAGTACCG CTTAATGTAG GCGAAAACCG CTACAACACA 226S60 
AAATATCTTG ATACAAAAGC GAAAAAAATG GGGCATTACA TTGTGCATAA CAATGACGAA 226620 
CAACAT^AA TGACCTGTCC ACATTGCCAG GAAGAGATTA TTTAACAAAT AAGATATGTT 226680 
TAAAAAGCCA TACTGTTATA TTGAACAGTA TGGCTT^T ATTTAAACTT TATGTTATGG 226740 
AATAATCAAA AGATTTTTTA GCATATTGCT AAAAACGGGA AACCCCGTCT CGTATTCCAA 22 , e00 
AGATCGGGGC TTAT CAGGAG TAAGTAAAAT GAAAAAAATT GCCTTTTTCA TTTTCTTGAT 226860 
TATCTGTTGT TTAGTTAGCT GTTCCGGAAC AGTTATCTAA GATAATCGCA AGGGGACGAT 226920 
G T AGGAGTTG TCCCCAATGC CTCCACCAAT CTTAAAGAAT TCTTTCCTTG ATTGCAAGGG 226980 
CAATTATTGA C^GAACCT TTCTCCACAA ATCTTTACTG TAAATGACAC CGACAGAATT 227040 
TACTTGCG CA CCCArTATAC TCGGTACAAG ATAAACAGG C GTACTATATT GAAAAATAGG 227100 
TAACACTAAA TGCTCTTGTT GAACTATCCC TTTTAATTTC GCGTAATTTT CTAACCGCAC 227160 
TTTTTCGGAT ATCGTCGTCA ITGCACT^C AAATAAACGA TCAAATTCCG CATTTTTATA 227220 
G CCATTTTTG TTATCTGGGC TTTTTGAATA GAATAAATTT AGAAATGCTG CAGGATCATT 227280 
AAAATCAGCA CACCAGCCAG AACGAATTAA TTGAAAATCG CCTTTTGTGC GGGCGGTTTG 227340 
CAATTCTTGC CAGCTCATCC CTTGATTTTC AACTCGTAAT AAATCAGATT GTGATAATTG 22740 0 
ATGATTTAAT CGCATTGCAA TGGTTTGATT CAGAGATAAA TCATCATAAC GAATACGTAA 227460 
TTTCAATGGA CGAGTTTCAC TAATTTGATT TTGGGAAAAG AGCTGTTCTG CAACAACTGG 227520 
CTCCCAAACA GATTCTTGCT CACCTAGCAT AG ATTTTGG C AAAAAAGTAT TGTTAGGATA 227580 
AAGATGAGCA ATATCTGCAA CCAAATTATT GGTAGAAATC ATCGAAACAA TCGCTTTTCT 227640 
TACCGCACTT TTTTGTAACA CAGGATCGGA AAGATTAAAC TCATAGAAGT AAGTACAAAG 227700 
TTGCGGATAA TCTTGGATAT TCTGATTGAC TTTTTTCGGA TTCATTACAA CATCAAAATC 227760 
AGACAAATCT GCATCAACGG AAATTTTTTG ATACTTCACT TGCTGAAATA TCACTTTTTC 227820 
TTTCGCCCAA TAATAAGGAT TGGTAGTTAA AATGTGCTGA TTTTCAGCCT GCCTTTGTAA 227880 
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TTGATATGCA CCATTGCTGA TAAATATTTC AGTCGATTTA GCATAGTGAG GCAAAAGACT 227940 
GACATGTGCC AG CAT AGAAG GTAAATATGG CGATGCTTTA TCTAATTCAA TGCGTAAAGT 228000 
GCGGTCATTT TCGGCAAACA ATCCTAGGGA TTCAACAGGT AACGCTTTTT CCAATACTGC 228060 
CTTCGCATTT TTCAGATTCA TAAAAGCCAA ATAATTTTTA AGTGGGCTTT CAGATTGAGA 228120 
AAGTGTTTGC CAAGATTGTA CAAAATCTGA TG CAGTGATA GGCTCGCCAT TTGACCATTT 228180 
AGCATTTTCA CGCAAAGTGA AAATCCAAGT TTTATTGTCT TCGGTTTGCC AATTTTCTGC 228240 
TACAGCTGGA ACAAGATTAC CCTGCACATC ATAAGCTGTT AATCCTTCAA AAAGATCTCG 228300 
TAATAATTGA GTTTGTTCTT CGCTTTGTGC TTGCCAAGGT TGCAAAGTCA AATCGCTATA 228360 
AACGCCACGT TTTAATTCAA CTCGGTTAGA TTCCAATTGA GTATTTTTTT GTGTCTCAAT 228420 
TTGTGGCGAA ATAGG CTTAG GACTATCCAA TTTATCACAA GCCACCACCG TAAAAATAAT 228480 
AGAAAAAATA ACCGCACTTC GTTGGCACAA AGCGAACAAA TTATTCATTT TCGGTTTTAA 228540 
TCCAAAATAA GAAGAACCAA AACTTAATAC AAATCAACGC AACAATAACA CTACGCCCTA 228600 
TTACACTTGG CGTGAAATAA AAACCAATTA GTAGCATTGT GGTAACAAAA GTCAAAATAA 228660 
AAATTTTGGT ATTTTTCTTT AATGATCGAT TTTCATGAAA GGACTTGAGA TGTTTTTGAT 228720 
AAATACTTGT GCCTAAAAAC CATTGTTCTA AGCGTTTTGA AC CTTTTGTG AAGAAAAATA 228780 
AAGTAAGTAA TAAAAAAGGC GTGGTAGGCA AAAGAGGAAG G AAAAATCC C AATGATGCCA 228840 
ATTGCAAGTG AAAGAAAACC TAAAATAATG TAAATAATTT TCATAATGTT CCTAATAAAA 228900 
AAGGAAAATG CAAACTATTA TTACTGATTT TTTAGATTTT TCAACCTTGA TATTATAGAA 228960 
TTTGCCGTCA CATAAAAAAA CAGTTTTCAA AAAAGGAACA TTTATGTCTA TGTCAAATCC 229020 
ATTACTTAAC ATTCAAGGCC TACCGCCATT TTCACAAATC AAGCCAGAAC ACATTCGACC 229080 
AGCTGTTGAA AAATTAATTC AAGATTGTCG CAATACGATA GAACAAGTGT TAAAACAACC 229140 
GCACTTTACT TGGGAAAATT TCATTTTGCC TTTAACGGAA ACTAATGATC GCTTAAATCG 229200 
TGCTTGGTCG CCCGTTTCTC ATCTCAATTC AGTAAAAAAT AGCACTGAAT TACGTGAAGC 229260 
CTATCAAACG TGCTTGCCGT TACTTTCTGA ATACAGCACT TGGGTTGGTC AGCATAAGGG 223320 
GCTTTATAAT GCCTATTTAG CGTTAAAAAA TAGTGCTGAA TTTGCCGATT ATTCGATTGC 229380 
ACAAAAGAAA GC CATTG AAA ATTCACTGCG TGATTTTGAA CTTTCAGGCA TTGGGCTTTC 2 2 944 0 
CGAAGAGAAA CAACAACGTT ATGGCGAAAT TGTGGCACGC TTATCGGAAT TAAATTCACA 229500 
ATTTAGCAAT AACGTATTAG ATGCCACCAT GGGTTGGGAA AAACTCATTG AAAACGAAGC 22 956 0 
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AGAACTTGCT GGTTTACCTG AATCCG C ACT ACAAGCTGCA CAACAATCTG CCGAAAGTAA 22 9620 

AGGATTAAAG GGCTATCGTT TTACGTTAGA AATCCCAAGT TATTTGCCTG TGATGACTTA 22 9680 

CTGCGAAAAT CGTGCATTGC GTGAAGAAAT GTACCGTGCT TATGCAACAC GTGCCTCTGA 22 974 0 

ACAAGGCCCT AACGCTGGCA AATGGGACAA CAGTAAAGTT ATGGAAGAAA TTCTGACGTT 22 9800 

ACGCGTTGAA CTGGCAAAAT TACTCGGTTT TAATACTTAT ACCGAACTTT CACTTGCCAC 22 9860 

AAAAATGGCA GAAAACCCAC AACAAGTGCT TGATTTCTTA GATCAT CTTG CCGAACGTGC 22 992 0 

CAAACCACAA GGCGAAAAAG AACTGCAAGA ATTGAAAGGT TACTGCGAAA AAGAATTTGG 22 998 0 

TGTCACTGAA CTTGCACCTT GGGATATTGG TTCTTACAGC GAAAAACAAA AACAACATTT 23 004 0 

ATACGCCATT AATGATGAAG AACTTCGCCC ATATTTTCCA GAAAATCGCG TGATTTCAGG 23 0100 

ATTGTTTGAA TTAATTAAAC GCATTTTTAA TATTCGTGCA GTGGAACGCA AAGGTGTGGA 2 3 016 0 

TACTTGGCAT AAAGATGTAC GTTTCTTTGA TTTAATCGAT G AAAATG AT C AACTCCGTGG 2 3 0220 

CAGTTTCTAT CTTGATTTAT ATGCACGCGA ACACAAACGT GGTGGTGCGT GGATGGATGA 2 3 0280 

CTGTATCGGA CGTAAGCGTA AACTGGATGG CAGCATTGAA ACGCCTGTGG CTTATTTAAC 23 0340 

CTGTAACTTC AACGCACCGA TAGGGAATAA AC CAG CGTTA TTTACTCACA ATGAAGTCAC 2 3 0400 

CACGCTATTC CACGAATTTG GGCACGGAAT CCATCATATG CTCACGCAAA TTGATGTATC 23 0460 

CGATGTGGCG GGAATAAATG GCGTGCCTTG GGATGCTGTG GAATTACCAA GTCAATTTAT 2 3 052 0 

GGAAAACTGG TGCTGGGAAG AAGAGGC TTT GGCGTTTATT TCAGGACATT ACGAAACAGG 23 0 5 80 

CGAGCCGTTG CCAAAAGAAA AACTCACGCA ATTGTTAAAA GCGAAAAATT TCCAAGCTGC 2 3 064 0 

GATGTTTATC TTACGTCAGC TTGAATTTGG TATTTTTGAT TTCCGTTTAC ATCACACCTT 2 3 0700 

TGATGCGGAA AAAACCAATC AAATTTTGGA TACACTTAAA TCAGTGAAAT CACAAGTTGC 2 3 076 0 

CGTCATAAAA GGCGTAGATT GGGCAAGAGC GCCACACAGC TTCAGCCATA TTTTTGCAGG 23 0820 

GGGCTACGCC GCAGGTTATT ACAGCTATTT ATGGGCTGAA GTATTATCGG CTGATGCTTA 2 3 0880 

TTCACGCTTT GAAGAAGAAG GTATTTTCAA CCCAATCACC GG CAAATC AT TCTTAGATGA 2 3 094 0 

AATTCTC AC C CGTGGCGGCT CAGAAGAACC AATGGAACTC TTCAAACGCT TCCGTGGCCG 2 31000 

TGAACCGCAA TTAGATGCGT TGTTAAGACA TAAAGGAATT ATGAACTAAT ACGTCAAACT 23106 0 

ATAGGGCATG CACAACATGC CCTTTCTCAA AT ATTTTC CT CGACATAAAA ACATAGTAAA 23112 0 

TTTTAACCGC ACTTTATCTA G AC AAAAC CT GCAAACGAGA CATTATTTTT AC AAAT AG CT 2 3118 0 

GTAAGATAAA CG CAT AGAAA TTCACGCAGA ATAAACTAAT TTCCTGTATA ATTTTTCACA 2 3124 0 
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TATCTATCCA ACTCATTACA CAAAATATTC TGAGGATTTT ATGGCTGCTG CAATTCAACA 2 313 00 

ACGTGCCGAA CTTCAACGCC GTATTTGGCA AATTGCTAAT GATGTGCGAG GCTCGGTCGA 23136 0 

TGG CTGGG AT TTCAAACAAT ATGTGCTTGG CACACTTTTT TACCGTTTTA TTAGCGAAAA 2314 2 0 

TTTTGCCAAT TACATTGAAG CGGGCGATGA AAGCGTAAAT TATGCCCAAT TACCTGATGA 2 314 80 

AATCATTACA CAGATGC C AT TAAAACGAAA GGCTACTTTA TTTACCCAAG CCAATTATTT 23154 0 

AAGAATGTTG CGGCTAATGC TGGCAGCAAT C CTAATTTG A ATACTGATTT AAAACAGATT 2 316 00 

TTTACTGATA TTGAAAACTC AGCGACGGGC TTTCCGTCTG AACAAGATAT TAAAGGGTTA 2 3166 0 

TTTGCCGATT TTGATACCAC CAGCAATCGC TTAGGCAATA C CGTAAAAG A TAAAAACGAC 2 317 2 0 

CGCTTAACGG CTGTTTTGAA AGGCGTGGCT GAACTTGATT TTGGCAAATT TGAAGATAAC 2 317 8 0 

CACATTGATT TATTTGGCGA TGCATACGAA TATCTTATTT CTAACTATGC CGCCAATGCA 2 31840 

GGCAAATCTG GTGGCGAATT TTTTACCCCA CAAAGTGTTT CCAAACTCAT TGCTCAAATT 2 31900 

GCAATGCACG GGCAAACCTC GGTCAATAAA ATTTATGACC CTGCAGCAGG TTCTGGCTCA 23196 0 

CTTTTGCTTC AAGCCAAAAA ACAATTTGAT GAACATATTA TTGAAGAAGG CTTTTTCGGG 23202 0 

CAGGAAATTA ACCATACCAC ATACAACCTT GCCCGTATGA ATATGTTTTT GCATAACATC 23 2 08 0 

AACTACGACA AGTTTGATAT TGCTTTAGGC AACACCTTAA TGGAACCACA ATTTGGCGAT 232140 

AATAAAC CTT TCGATGCCAT TGTTTCGAAC CCGCCTTACT CCGTGAAATG GGCTGGCTCC 2 3 2200 

GACGATCCAA CATTGATTAA TGATGAACGA TTTGCCCCCC GCAGGCGTGC TTGCACCAAA 2 3 2260 

ATCCAAAGCG GACTTTGCCT TTATTTTACA TGCGTTAAGT TATCTTTCAG CAAAAGGCCG 2 32 32 0 

CGCGGCGATT GTTTCCTTCC CTGGTATTTT TTATCGTGGC GGTGCCGAGC AAAAAATTCG 2 32380 

TCAATATTTG GTGGATAATA ACTATGTGGA CGCGGTGATT GCGCTTGCGC CAAATCTCTT 2 3244 0 

TTTTGG C AC C AGTATTGCGG TGAATATTTT GGTGCTTTCC AAACACAAAC CCAATTTATC 2 32 500 

GATGCCAGCG GTTTATTTAA ATCTGCCACT AATAACCACA TTTTAGAAGA GGAACATATC 2 32 56 0 

GAGCAAATCC TCAAACTGTT TGCCGATAAA GAGGATGTGC CGCATTTGGC AAAATCCATA 2 3 2620 

TCCTTTGAAG AAATCGCCCA AAATGACTAC AACCTTGCAG TAAGTTCGTA TGTGGAACAA 2 3 26 60 

AAAGACACTC GTGAAGTGAT TAATATTGAT GAACTCAATG CTCAAATTCG TGaAACTGTT 23 2 74 0 

ACCAATATTG ACCACTTGCG TGCGGAAATT GACAAGATTG TTGCAGAAAT TGAAGGGTAA 2 32800 

AATGTAGGGT GGGCTTCAGC CCAnaTTTAC GATAACAGGT AGTGGGCTAA AGCCCaCCCT 2 32860 

ACGAGGAACA ATAAAAAAAC ACCATGAAAA ACAACCGCAC TTTTTTAGAA AAATTATTGG 2 32920 
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AGGGGTCTGA GGTTGAGTGG AAGCCTTTAG ATGAAGTTGC CAATATCGTC AATAAtGCAA 232980 
GAAAACCAGT CAAATCATCC TTACGAGTGT CAGGGAATAT TC CATATTAT GGTGCTAATA 233040 
ATATCCAAGA TTATGTCGAA GGATATACTC ACGAGGGCGA ATTTGTATTA ATTGCTGAAG 233100 
ATGGTTCTGC TAGTCTAGAA AATTACTCTA TTCAATGGGC TGTAGGTAAA TTTTGGGCAA 233160 
ATAACCACGT CCATGTAGTA AATGGCAAAG AAAAATTAAA TAATCGTTTT TTATACCATT 233220 
ATTTAACTAA TATGAATTTC ATTCCATTCT TAGCTGGAAA GGAACGTGCA AAATTAACAA 233280 
AAGCTAAACT GCAACAAATT CCAATCCCCA TCCCCCCACT CTCCGTCCAA ACCGAAATCG 233340 
TAAAAATTTT GGACGCATTG ACAGCACTTA CCAGCGAGCT TACCAGCGAG CTTACCAGCG 233400 
AGCTTACCAG CGAGCTTATA CTACGCCAGA AACAGTATGA ATACTATCGA GAAAAATTGT 233460 
TAAATATTGA TGAAATGAAC AAGGTTATTG AACTTGGTGA TGTAGGGCCT GTTCGTATGT 233520 
GTAAACGAAT TCTTAAAAAC CAAACAGCAA GCTCTGGAGA TATTCCATTT TATAAGATTG 233580 
GAACTTTTGG GAAAAAGCCT GATGCCTATA TTTCAAATGA G CTTTTCCAA GAATATAAAC 233640 
AAAAATATAG CTATCCTAAA AAAGGAGATA TTCTTATATC TGCTAGCGGA ACTATTGGTA 233700 
GAACAGTTAT ATTTGATGGA GAAAATTCTT ATTTTCAAGA TAGCAATATC GTCTGGATTG 233760 
ATAATGATGA AACTCTCGTT CTTAATAAAT ATTTATATCA TTTCTATAAG ATTG CTAAAT 233820 
GGGGAATTGC AGAAGGAGG C ACCATTCAAC GTTTATATAA CGACAATTTG AAAAAAGTAA 233880 
AAATTTCTAT TCCTCCATTA AAGGAACAAC ACCGCATCGT TTCAATCCTA GATAAATTTG 233940 
AAACCTTAAC CAATTCCATC ACTGAAGGCT TACCTTTAGC GATTGAACAA AGCCAAAAGC 234000 
GGTATGAATA TTACCGAGAA TTGCTATTAA ATTTCTCGTA ACCCGTAGTG GGCTTCAGCC 234060 
CACCAATTAA CCATAACAAA CGGTGGGTTA AAGCTCACCC TACAATGTAA TGAGAATATT 234120 
TTTATGGTGG GCTAAAGCCC ACCCTACGAA CTACAATTTG GAATAATATA TGTCTAATTA 234180 
TCGGCGCGAT TTTACTAAAG GTGGATTATA TTTTTTCACA ATTGTTTTAC AAGATAGAAC 234240 
AAAATCTTAT CTAACTGACT ATATCAATGA ATTTAGATCT GCATATAAAC AAACTTGTGA 234300 
ACATTATCCA TTCGAAACAG TAGCAATTTG TATTTTGCCC GATCATATTC ATTTACTGAT 234360 
GCAATTACCT GAAAATGATG ATAATTACGC AATACGCATC GCATATTTAA AAACACAATT 234420 
TACACGACAA CTTCCAAAAG AATGCCGACA ATTTAATAAA AATAGACAAA AATATCGAGA 234480 
ATCAGGTATT TGGCAACGCC GATTTTGGGA G C ATTTAATT CGTGATGATA AAGATTTAGC 234540 
GAATCATTTA GATTATATTT ATTACAATCC TGTGAAACAC GGCTATGTTG AGGTAGTAAA 234600 
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AGATTGGCCG TATTCTTCCT TCCATCGTGA TGTGAAAGCA GAGATTTATC CTGAAGATTG 2 3 4660 
GGGAGGCAAC CCAGATTTGA AAATTAAAGG TGATATATAA CAATACTGTA GGGTGGGCTT 2 34 72 0 
CAGCCCACAA TTAACCATAA CAAACGGTGG GCTAAAGcCC ACCCTACAAT GTAATGAGAA 2 34 78 0 
TATTTTTATG ATGGGCTAAA GCCCACCCTA CGAACTGAAT AACTGAAAAA TATACTAAGG 23484 0 
AAAAACCATG ACCCAATACA AAACTATCGC TGAATCCAAT AATTTTATCG TTTTAGATCA 2 34 900 
ATATAATAAA TTTG TGGAAG AATCTAATGC TGGTTATCAA ACGGAAAGGA GCCTTGAGCG 2 34 96 0 
TGAGTTTATT CGTGATTTAC AGGCTCAAGG CTATGAGTAT TTACAATGGC TTAATAATCA 2 3 502 0 
CGATGAACTG ATTAAAAACT TACGGGCGCA ATTACAACGC TTAAATAACG TGGTTTTCTC 2 3 508 0 

CGATGCAGAA TGG CAACGTT TTTTAGAGGA ATATTTGGAT AAACCGAGCG ATAATCTGAT 2 3 5140 

TGAGAAAACC CGCAAAATTC ACGATGATTA TATTTATGAT TTTGTGTTCG ATAACGGACG 2 3 5200 

CATTCAGAAC ATCTATTTGC TTGATAAGAA AAATCTTGCC AATAATTCTC TGCAAGTCAT 2 3 5260 

CAATCAATTT AAGCAAACTG GCAGCTATGA TAATCGTTAT GATGTGACAA TTTTGGTGAA 2 3 332 0 

TGGTTTACCC CTTTATTGAA TTAAAAAAAC GCGGCGTGGC GATTCGTGAA GCCTTTAACC 2 35380 

AAATTCACCG TTACAGCAAA GAAAGTTTCA ATAAAGaAAA TTCTCTCTTT AAATATATTC 2 3 544 0 

AGATTTTTGT CATTTCTAAT GGCACGGATA CTCGCTATTT TGCTAATACG ACTAAACGCA 2 3 550 0 

ATAAGAATAG CTACGACTTC ACAATGAATT GGGCAACGGC AAAAAATACT CTGATTAAAG 235560 

ATTTAAAGGA TTTTACCGCG ACTTTCTTGC AAAAGAATAC TTTGCTCAAT GTGTTGGTAA 2 3 5620 

ATTACTGCGT GTTTGATGTG AGTGATACGT TGTTAATTAT GCGTCCGTAT CAAATTGCCG 2 3S68 0 

CAACAGAACG TATTTTATGG AAAATtCAAA TTTCTTACTT AGCAAAAAAT TGGAGTAATC 2 3 574 0 

GTGAAAGTGG TGGCTATATT TGGCATACCA CAGGTTCAGG CAAAACCCTC ACCAGTTTTA 23 5800 

AAGCCTCTCG CCTTGCGACT GAACTTGATT TTATTGATAA AGTCTTTTTT GTGGTCGATC 2 3 5860 

GTAAAGACTT AGACTACCAA ACGATGAAAG AATATCAGCG TTTTTCGCCT GATAGCGTGA 23592 0 

ATGGGTCGGA AAGTACCGCT GGGCTTAAAC GCAATATTGA AAAAGATGAT AACAAAATTA 2 3 598 0 

TCGTAACCAC CATTCAAAAA TTGAATAATT TAATGAAAAG TGAAGAAAAC CTGTCTATTT 23604 0 

ATCAAAAACA GGTGGTCTTT ATTTTCGATG AAGCACATCG CTCTCAATTT GGCGAAGCAC 2 36100 

AAAAAAATCT AAAACGTAAA TTCAAAAAAT TCTATCAATT TGGTTTTACT GGCACGCCTA 236160 

TTTTCCCTGA AAACGCATTA GGTGCGGAAA CG AC AG CAAG TGTGTTCGGT GCGGAATTGC 2 3 6220 

ATTCTTATGT GATTACCGAT GCTATTCGTG ATGACAAAGT ACTGAAATTC AAAGTCGATT 2 362 80 
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ACAACGATGT CCGCCCACAA TTTAAAGCCT TAGAAACAGA AAAAGATCCT GAAAAATTGA 2 36 34 0 

CCGCACTTGA GCAGAAACAA G C CTTTTT AC ACCCTGAGCG TATTAAAGAA ATCTCGCAAT 2364 00 

ATTTGCTTAA CAATTTTAAA CAGAAAACCC ACCGCTTGAA TGCCACAGGT AAAGGCTTTA 236460 

ATGCAATGTT TGCGGTAAGC AGTGTAGAAG C AG C AAAACG TTATTACGAA AC CTT AC AAA 236 520 

ATTTACAGGC AGAGCAGGAA TATCCGTTAA AAATTGCAAC AATCTTTTCG TTTGCCGCCA 236 58 0 

ATGAAGAACA AGATGCGATT GGCGATATTC CAGATGAGAC CTTTGAACCC ACTGCTCTAA 236640 

ACAGTACGGC AAAAGAATTT TTAACAAAAG CCATTGATGA TTACAATCAC TACTTTGGCA 2 3 6700 

CGAATTATGG CGTAGATAGT CAATCATTCC AAAATTACTA TCGCGATCTT GCCAAACGTG 236760 

TGAAAAATCA GGAAGTGGAT TTACTGATTG TGGTCGGAAT GTTCTTAACA GGCTTTGATG 236 820 

CCCCTACGTT GAATACTCTT TTCGTGGACA AAAACTTGCG TTATCACGGA TTAATG CAGG 236 880 

CTTTTTCTCG CACCAACCGC ATTTATGACA CAACTAAAAC CTTTGGGAAT ATCGTCACTT 236940 

TCCGCGATTT GGAACAAAAT ACCATTGATG CAATCACGCT ATTTGGCGAT AAAAATACGA 2 3 7000 

AGAATGTCGT ACTAGAAAAA AGCTACGACA GCTATTTTAA CGGTGATGAC AATCAACGTG 23 7060 

GCTATGCAGA AATCGTGAAA GAATTAAAAG AAAGCTTTCC TGATCCAACA GAAATCGAAA 2 3712 0 

CAGAGCAAGA TAAAAAAGAG TTTGTAAAAT TATTTGGCGA ATATTTGCGT GTCGAAAATA 23 718 0 

TTTTGCAAAA TTACGATGAG TTTGCAGCAT TGCAAGCCTT ACAAGCGGTC GATTTAAATG 2 37240 

ATCCGATTGC AATGGAAAAA TTCAAGCAAG TACATTATGT GAATGATGAA CAAATTG CAG 23 7300 

AAATGCTGAA AGTACCCACT TTACCAGTAA GAGCGGAGCA AGATTATCGT TCAACTTATA 2 3 7360 

ATGATATTCG CGATTGGTTA CGCCAAAGAA AAGAGGGGAA TGATAAGGAT AATTCGCCGA 2 3 7420 

TAAATTGGGA TGATGTCGTG TTTGAAGTGG ATTTATTAAA ATCTCAAGAA ATCAATTTGG 2 3 7480 

ATT ATATT CT TGCGTTAATT TTCGAACATC ATAAGAAAAA C C AAG AC AAA GAGGTGTTAA 2 37 54 0 

TTGATGAAAT CCGCCGCACA GTTCGTTCAA GTTTGGGTAA CCGTGCGAAA GAGAGCTTGA 2 3 7600 

TTGTCGATTT TATCAACCAA ACAAATTTAG ATGATATTCC CGATAAAGCG ACTTTAATTG 2 3 7660 

ACTCATTCTT CCTATTTGCT CAAGCAGAAC AGCGAAAAGA AGCAGAATCC TTAATTCAAG 2 3 7720 

AAGAAAATTT GAATGTCGAT G C AG CAAAAC GCTATATCAG CACTTCATTA AAACGGGAAT 23778 0 

ATGCCAGTGA AAACGGCACG GCACTTAATG AAGTATTGCC AAAAATGAGT CTACTTAAGC 2 3 784 0 
CACAATATCT CACTAAAAAG CAGAAGATTT TCCAAAAAAT TGCTGCATTT GTAGAGAAAT 237 900 
TTAAAGGGGT TGGAGGAAAG ATTTAAATTT AATATATCAA CAATTAGACA TCTTATTAAG 237 96 0 
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ACATAAAGGC ATTATGAATT ATGAATTAAT AATACAAAAC ATACTGGTCA ATATTTAACT 2 38 02 0 

AGTCGTGATA ATAAAAACAG CTCCAATTTA TACTTGGAGC TGTTTTTTAT ATCGATAAAC 238 08 0 

TTACTTTTTA CGTGCTAAAA TTCGATTTGC ACACTCTGCT GCAGCTGTTA AGCCACCTGC 23814 0 

CATCATAATA ATATCTTTAA TCACTAAACG ACCAACTCCA GATAAATAAG GG AAAC CAT A 2 3 8200 

GGCTGGCGTT GGTAAATCTC CACCTAGATT AGGCACCCAA GCCTCGGGCG TTGTGATTAA 2 3 826 0 

AAATG AC AG C GTGACTATTG ACAT G C C AAA TGTCAGTAGG CCTCCTGCCA AACCTGCCGT 2 33 320 

TGCATTCCAA ATACCAGCTA AAGTAAGAAT ACCAACAGTA ACAATAATTG CTCCAATAAT 2383 8 0 

GTAAGAAGCC GTATAGGTGC CATTTTCTTT ATGCCATTCG ATGTTTTTTG C AAC CAT TTT 2 3 8440 

TCCTTCAGGA TTTTTGTGTA GTGTATATTC CATCACTAAC TCTCCTTTAT CATTCGGCAC 2 3850 0 

TAAATTTGGC C CTTTTTC AT ACATATAGCT AAAAAACGGG CT ATT TG AT A CAAAGTGAGC 2 3 856 0 

AATTCCATCT GCCTCATATT GACAAACTTT CAAACCACCG ATCCATGCCA TTACAATAAA 2 3 8620 

AATAGCAATA CGGATAAAAT TAATAAATTG ACGCTGCATT GGTGCAACAA TTCTCGCGAC 23 86 8 0 

AAATT CAAC A AATGTATTAA ATACTGACAT ATAAATTTCC ACATAAAAGT AAATAAAATT 2 3 8740 

GAGAGCCTAA TTCTACTCGC TTTAATATTA AAAAGCAAAA TCAATTCCCT TTAATAAAAG 23 8800 

ATAAATAACG GAGAAATTTG AAAAGTTTTA CTAATTAAAA TAAAATTTGA TTTTCCCATT 2 3 8860 

TTAAGCCTCT TTATTATCTT TAAATCACAT GTAAATAATC TTTACACAAC TACCTTATAA 23 8 92 0 

TATTAAAATT TACAAAATTT AAGCAAATCC TCCACATCTT CTAACCAAAG ATTTCCAATT 2 3 998 0 

TTATCAATCA ATTCTCCACA ATTAAAAACA TCAGAATTTT TAG C CG CACT TTCTAAAGTA 2 3 904 0 

TCCGCTAATT GACGTAAATG ATTTAAC C CT ACCGATGCCG CTGCGCCTTT AATTTTATGG 2 3 910 0 

GCGGTATCTT CTGTTTCTTT CATATTATCT TTTGATAATT GAGC CAGAT A ATTTGGCATA 2 3 916 0 

GTTTGTTTGA ATAAAGATAA ATTTTCTAAA ATTTGGGATT TTCCAAGTGT TTCTATCAGC 2 3 922 0 

GCTAAATCAA ACTGTTCTGA AAGCTCATTA TCATCATTCA TTTCTATAAT AG ATTCG CTT 2 3 9280 

TCATCCGCAA AAAATTGCTG TAGACAATGG TGTAAATCCT TAATTGAAAT GGGTTTACGT 2 3 934 0 

AATACG C CAT CCATGCCCAT TTCTAAATAT TCTTGCTCGC TTTG CAT AAC ATTGG CGGTA 2 3 9400 

AAAGCAATCA TTGGCGGTAA AAAATCATAA ATACCGTTTT CATAGTTCTC TCGCAAATAT 2 3 9460 

TGGGCAATTT CAAAACCACT CATATCTGGT AATTTAATAT CGAGCAATAG AATGTCATAA 23 952 0 

ACATTTTTCT CAAATAAAGT AATCGCCTGC TTGCCATTTG TCGCAACATC AACATGATGC 2 3 958 0 

CCAAGTTTTT CTAAAATAGT TTTTGCAACC ATAATATTGA GATCAACATC TTC C AC T AAA 239640 
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AGAGTATTCA AGGATGGTAT ATTTTnCTTA ACAGATTCTT TTTCAGAAAT TTCCTGAGCG 23 9700 

TGGAGAGTnA GATGGAAAGT CGCGCCTTTT CCTCGTTCAC TTTCAACTGT TAAATCCCCT 2 3 9760 

TCCCATTAAC TGAGCAAGAT TTTTAGAAAT AGCCAACCCA ATACCGCTAC CCGCACTTTG 2 3 9820 

CTGGCGGCTT TCTTTCACTT GATAATACAT TTCAAAAATA TGTTTTTGTT CTTCAGGTGA 2 3 9880 

AATCCCCATT CCTGTATCTT TCACAATAAA ATGGTAACAA TCCTGATTTC TCATAATTTT 2 3 994 0 

AAGAATAATA TTTCCCTGAT CCGTAAACTT CACCGCATTA CTAATTAAGT TCCACAAAAT 240000 

TTGGCTCAAG CGAACACGAT CAAGATTCAA CCAATTAGGC AAATTAGGTT CAAGCTCTAA 24 0060 

AGAAAAAATA AGATTTTTTT CTTTGGCGAG GAAACTAGCA AAATTATAAA TATCGTTTAA 240120 

TAAGGCAGGG AAATCTGTTG GCTGGCGATT TAATTCAATT CGGCTGGCAT CAATTTTTTC 24 0180 

CAAATCAATA AT ATCG CTAA AAATATAGCC TAAACTGACC GcACTTATGT TGATAGTTTT 24 0240 

TAAATAATTA CGCTGGAGAT CATCCAATTC ATCATCAAGT AAAATCTGGC TAAGCCCGAC 24 03 00 

AATCCCATTC AATGGCGTGC GAAATTCATG GCTTATTGTG GCAAGTAAGG TACTTTTATC 2 4 0360 

ACGGCTATTT TTTTCTAACT TTTCCAATGC TTGGGATAAT TTTTTCTGAG CAGCAATTCG 24 0420 

CTCGGCGACC TCATTTTCCA ATCGATTAAC CGAGCTAGAA AGATCAAGAC GAGAATGTTC 24 0480 

AAGTTTTTCT ACTAATAAAG TGAAAAAATA AATAACAAAA GGCGCAGTGA GTAAGCCAAA 24 054 0 

AGTTACCGAA CGAAAAATAT CTACCCAAGA TATCTGATGA ACAATAAATA GACTAAATAA 240600 

AATCTGAGTA CAAAGAGCTA AAACCGCGAG AATCATCACG CCTAAAAGAG AAAAACGAAG 24 0660 

ACGCCCAAGA CGAATAACCC AATCCACATA ACTCTGAGCA AAATATTTGA AATTTTTCAT 24 072 0 

CAAATCTCCT AAAACATTAT TATCGTTATA AAAAAAACCT GTGCATATAG GCTTTATTAT 24 078 0 

ATCCATAAAG TAAACGAGAT TGTTATTACT ATGTATGCGT AAGAAATGAA AAACTTAATC 24 084 0 

TTTATTAATT ATCTGTTTGA CTAAAATTTG TAATTCAGGA ATAACTTGCT CTTCAAACCA 24 0900 

AGGATTCTTT GTTACCCAAA GCTGATTACG TGGGGAAGGG TGCACGAGAG GCATAAAGTG 24 096 0 

CGGTAAGAAT TGACGATAAT TTTTAACAGT GTTAGTTACA TTATCTTTAT TTTCGGGTAA 241020 

ATAATATTTC TGTGCATATT GCCCAATAAG AATAGTGAGC TGAATATTAG GTAGATTCCC 2 4108 0 

TAAAATCATT GGATGCCATC TTTCTGCAAA ACCTTGACGA GGTGGTAAAT CACCCGATTT 241140 

AC CTTTTCC A GGATAATAGA AATC CATTGG AAGAACCGCA AATATCCCAG AATTATAAAA 241200 

ATAATCGTAA TCTACTCCCA ACCATTCTCG CAGACGATCA CCGCTTTTAT CATTCCAATA 24126 0 

CAAACGAGAT TGCTCAGCTT TTAACCCAGG AGCTTGACCA ACAATATTAA TTCGAGCTGT 241320 
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TTTAGGAGCC GAAAATAAAG GGAAAATTCC ACGTTCTGTA AAATCTTTAT TTTGAGGATC 2 413 8 0 

AGCGATAATA CTTGAAGTTA TTTCGTCTAA ATTTTTAAGC ATAATGAAGG AGATTAGAAA 24144 0 

TGAATATATA AAGTCTATTT GAAAAATATG AAAAATCAAA ACTCTAAATG ATGTAAATAG 2 415 00 

GTATATTTAT TAAAACTATA TTGTAGGCAA TACAAAACTA AAAACTTGAT ATTGAAATAC 241560 

AATCTCTATT TTCTACTTTA ATCCTTTTCA TGAAATAATG AATAAATTCC CAAAAGCTAA 2416 2 0 

AACCCCATAT CTTTCGACAC GGGGCCCTTC TGTTTTTATT TAACTTAGAT AAGTATTTGG 2416 8 0 

TAACCTGGCG GTGCCCTACT CTCACATGGG GAAACCCCAC ACTACCATCG GCATTACAGC 24174 0 

GTTTCACTTC TGAGTTCGGT ATGGTCTCAG GTGGGTCCAC CGCACTATCG CCGCCAGGAT 2418 00 

AATTCTTTGA TAACTCGTCT TTTC TCTTTT ATCTCTCGTC TCTTTTATCT CTCGTTACTC 2418 6 0 

AGTCTTTCCT CTACCTTTCA CTCGTTCTTA TTGGTCACAA GCTGAATCCG ATTTTCTTTC 24192 0 

TATTAAGTCT TCTATATTTA CTTTTCTTTG TTTAGTCTTT TACTTCTCTT TCGCTCTGCT 24198 0 

TTTATTTGCT TCGCTTTTCA TCTCTTACTT CACTTAGCTC CCAAAAACAC TTGAGCGTTG 24 2 04 0 

TATAGTTAAG CCTCTCGGGC AATTAGTATT GGTTAGCTCA ATGACTCACA TCACTTACAC 24210 0 

ACCCAACCTA TCTACGTCGT AGTCTACAAC AACCCTTACT GACTTAAAGT CAGGGATGAC 2 4216 0 

TCATCTCTTG GCAAGTTTCG TGCTTAGATG C TTTC AG C AC TTATCTCTTC CGCATTTAGC 242220 

TACCCGGCAA TGCGTCTGGC GACACAACCG GAACACCAGT GATGCGTCCA CTCCGGTCCT 2 4 2280 

CTCGTACTAG GAGCAGCCCC AATCAATCAT CCTACGCCCA CGGCAGATAG GGACCGAACT 2423 4 0 

GTCTCACGAC GTTCTAAACC CAGCTCGCGT ACCACTTTAA ATGGCGAACA GCCATACCCT 24 24 0 0 

TGGGACCTAC TTCAGCCCCA GGATGTGATG AG CCG ACATC GAGGTGCCAA ACACCGCCGT 24 24 6 0 

CGATATGAAC TCTTGGGCGG TATCAGCCTG TTATCCCCGG AGTACCTTTT ATCCGTTGAG 24 252 0 

CGATGGCCCT TCCATTCAGA ACCACCGGAT CACTATGACC TACTTTCGTA CCTGCTCGAC 242 58 0 

TTGTCTGTCT CGCAGTTAAG CTTG CTTATA C C ATTG C ACT AACCTCACGA TGTCCGACCG 2 4 2640 

TGATTAGCAA ACCTTCGTGC TCCTCCGTTA CGCTTTGGGA GGAGACCGCC CCAGTCAAAC 242 7 00 

TACCCACCAG ACACTGTCCG AG AC C ACGTT TCGTCATCTT CGTTAGAACA TCAAACGTTA 242 76 0 

AAGGGTGGTA TTTCAAGGTC GCCTCCACAA TGACTGGCGT CACTGCTTCA AAGGCTCCCA 2 4282 0 

CCTATCCTAC ACATCAAAAT TCAATGTTCA GTGTCAAGCT ATAGTAAAGG TTCACGGGGT 2 42 880 

CTTTCCGTCT AGCCGCGGGT ACACCGCATC TTCACGGCGA TTTCAATTTC ACTGAGTCTC 24 2 94 0 

GGGTGGAGAC AGCCTGGCCA TCATTATGCC ATTCGTGCAG GTCGGAACTT ACCCGACAAG 24 3 0 00 
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GAATTTCGCT AC CTTAGG AC CGTTATAGTT ACGGCCGCCG TTTACTGGGG CTTCGATCAG 243060 
GTGCTTCTCT TTCGATGACA CCATCAATTA ACCTTCCAGC ACCGGGCAGG CATCACACCC 243120 
TATACGTCCA CTTTCGTGTT TGCAGAGTGC TGTGTTTTTA ATAAACAGTT GCAGCCAGCT 243180 
GGTATCTTCG ACCGGTTCAA CCTTCAGGGG CTAGCCCTTA CAATCTACG C CGGCGCACCT 243240 
TCTCCCGAAG TTACGGTGCT ATTTTGCCTA GTTCCTTCAC CCGAGTTCTC TCAAGCGCCT 243300 
GAGTATTCTC TACCTGACCA CCTGTGTCGG TTTTCAGTAC GGTTTAGTAA AGCCTTTCGC 243360 
TTAGTGGCTT TTCCTGGAAG TGTGGTATCA GTTACTTCAG CTCCGTAGAG CCTCGTCATC 243420 
ATCTCTCTGT GTTAAGGAAG TCCGGATTTG CCTAAACTTC CCACCTACCA ACTTAAACGC 243460 
ACATATCCAA CAGTGCGATA ACCTAACCTA CTCCGTCCCC ACATCGCAGC TTTACCAAGT 243540 
ACAGGAATAT TAACCTGTTT CCCATCGACT ACGCTTTTCA GCCTCGCCTT AGGAGCCGAC 243600 
TCACCCTGCC CCGATTAACG TTGGACAGGA ACCCTTGGTC TTCCGGCGAA CGGGTTTTTC 243660 
AC CCGTTTTA TCGTTACTTA TGTCAGCATT CGCACTTGTG ATACGTCCAG CCCACCTCTC 243720 
GATGAACCTT CTTCCGCTTA CACAACGCTC CCCTACCCAA CAGAATAAAT TCTGATGCCG 243780 
CAGCTTCGGT G CT ATATTTC AGCCCCGTTA CATCTTCCGC GCAGGCCGAC TCGACTAGTG 243840 
AGCTATTACG CTTTCTTTAA ATGATGGCTG CTTCTAAGCC AACATCCTAG CTGTCTAAGC 243900 
CTTCCCACTT CGTTTCCCAC TTAATATAGA CTTGGGGACC TTAGCTGGCG GTCTGGGTTG 243960 
TTTCCCTCTC CACGACGGAC GTTAGCACCC GCCGTGTGTC TCCTGAGTAT CACTCTTTGG 244020 
TATTCGTAGT TTGCATCGGG TTGGTAAGCC GGGATGGCCC CCTAGCCGAA ACAGTGCTCT 244080 
ACCCCCAAAG GTGTCACCTC AAGGCTCTAC CTAAATAGAT TTCGGGGAGA ACCAGCTATC 244140 
TCCCGGTTTG ATTGGCCTTT CACCCCCAGC CACAAGTCAT CCGCTAATTT TTCAACATTA 244200 
GTCGGTTCGG TCCTCCAGTT AGTGTTACCC AACCTTCAAC CTGCCCATGG CTAGATCACC 244260 
GGGTTTCGGG TCTATACCTT GCAACTATTC GCCCAGTTAA GACTCGGTTT CCCTTCGGCT 244320 
CCCCTATTCG GTTAACCTCG CTACAAAATA TAAGTCGCTG ACCCATTATA CAAAAGGTAC 244380 
G CAGTCACAA GGTTGCCCTC GCTCCCACTG CTTGTACGTA CAAGGTTTCA GGTTCTATTT 244440 
CACTCCCCTC ACCGGGGTTC TTTTCGCCTT TCCTTCACAG TACTGGTTCA CTATCGGTCA 244500 
ATCAGGAGTA TTTAGCCTTG GAGGATGGGC CCCCCTTCTT CAAACAGGAT ATCACGTGTC 244560 
CCGCCCTACT TCTCGTTAGC TTAGTACCAC AATATGGTTT TTAGATACGG GATTATCACC 244620 
CTCTTTGATT GTGCTTCCCA ACACATTCTC CTCACTATAT TGCTATCACT TACTGGCTCT 244680 
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TTCGCTTTCG CTCGCCGCTA CTGACGAAAT CTCGGTTGAT TTCTTTTCCT CGGGGTACTT 244 74 0 
AGATGTTTCA GTTCTCCCGG TTTGCCTCAA CAAGCTATGA ATTCACTTGT TGATAGTAGA 244800 
TTCTTCATCT ACTGGGTTTC CCCATTCGGA TATCTTGGAT TAAACGCCTC TTATCGACTC 244860 
ATCCAAGCTT TTCGCAGATT AGCACGTCCT TCTTCGCCTC TGATTGCCAA GGCATCCACC 244920 
TTGTACGCTT AGTCACTTAA CTATACAACC TCAAATGTTT TCAATCACTT TACGTTTTCA 244 980 
CTTCAAAGTG CGGTCAATTT GATGCGTATT TTCATTCAAC TAAACACTTG ATTGCTTTTG 24 5040 
TTCAATCAAG ATTTTTTTAC AAAACAGGTT TTGTTTTTAG AGTTTCCTTT TCAGTTCTCT 24S100 
CTTTCACTTT ACTGTTTTGC CTACTCAGAC TTTCTCTCGA AAATCTCTTG TTTTC AG CTT 24 5160 
GTTTCCAATT TTTTAAAGAA CAGTAAGACA ATCTAATAAA GTTATCTTTA ATTGGCGTCC 24 5220 
CCACGGGGAT TCGAACCCCG GTTACCG CCG TGAAAGGGCG ATGTCCTAGG CCTCTAGACG 24 5280 
ATGGGGACAT CAATTAAAGA TACTCTATTT TCTTTTTATC TTTTAACGtT TATTTTTCAA 24 53 40 
CGTTGCTTTC GCTTCGTTTT TAATCTTGTC TACAACTATC AGCCAATCTG TGTGAGCACT 24 54 00 
CGCTGTCTCT CGTCTTCAGT AAGGAGGTGA TCCAACCGCA GGTTCCCCTA CGGTTACCTT 24 5460 
GTTACGACTT CACCCCAGTC ATGAATCATA CCGTGGTAAA CGCCCTCCAA AAGGTTAAGC 245520 
TATCTACTTC TGGTACAACC CACTCCCATG GTGTGACGGG CGGTGTGTAC AAGGCCCGGG 245580 
AACGTATTCA CCGCGACATT CTGATTCGCG ATTACTAGCG ATTCCGACTT CATGGAGTCG 24 564 0 
AGTTGCAGAC TCCAATCCGG ACTTAGACGT ACTTTATGAG ATTCGCTCCA CCTCGCAGCT 24 5700 
TCGCTTCCCT CTGTATACGC CATTGTAGCA CGTGTGTAGC CCTACTCGTA AGGGCCATGA 245760 
TGACTTGACG TCATCCCCAC CTTCCTCCAG TTTATCACTG GCAGTCTCCT TTGAGTTCCC 245820 
GACCAAGTCG CTGGCAACAA AGGATAAGGG TTGCGCTCGT TGCGGGACTT AACCCAACAT 2 4 5880 
TTCACAACAC GAGCTGACGA CAGCCATG C A GCACCTGTCT CTAAGTTCCC GAAGGCACAA 24 594 0 
GCTCATCTCT GAGCTCTTCT TAGGATGTCA AGAGTAGGTA AGGTTCTTCG CGTTGCATCG 24 6000 
AATTAAACCA CATGCTCCAC CGCTTGTGCG GGCCCCCGTC AATTCATTTG AGTTTTAACC 246060 
TTGCGGCCGT ACTCCCCAGG CGGTCGATTT ATCACGTTAG CTACGGGCAC CAAGCTCTAA 24612 0 
GCCCAATCCC CAAATCGACA GCGTTTACAG CGTGGACTAC CAGGGTATCT AATCCTGTTT 246180 
GCTCCCCACG CTTTCGCACA TGAGCGTCAG TACATTCCCA AGGGGCTGCC TTCGCCTTCG 246240 
GTATTCCTCC ACATCTCTAC GCATTTCACC GCTACACGTG GAATTCTACC CCTCCCTAAA 246300 
GTACTCTAGT TACCCAGTCT GAAATGCAAT TCCTAGGTTA AGCCCAGGGC TTTCACACCT 2 46 360 

-77.148- 

SUBSTTTUTE SHEET (RULE 26) 



BNSDOCID: <WO 9633276A1 J > 



WO 96/33276 

CACTTAAATA ACCGCCTGCG 
TCCGTATTAC CGCGGCTGCT 
ATTTGATGTA CTATTAACAC 
GGCCTTCTTC ATTCACGCGG 
CACTGCTGCC TCCCGTAGGA 
CTCAGACCAG CTAGAGATCG 
ACTTGGGCTC ATCCTATGGC 
CGCGGTATTA GCGACAGTTT 
TACTCACCCG TCCGCCACTC 
TGCATGTGTT AAGCCTGCCG 
AGTTCAATCG CTCAATAAAC 
TGAATTTTCA GTTAAGCACC 
AATCAACAAG TGCCCACACA 
CACTTGGAAA CTTAAAAAAC 
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TGCCCTTTAC GCCCAGTTAT TCCGATTAAC GCTCGCACCC 2 46420 

GGCACGGAGT TAGCCGGTGC TTCTTCTGTA TTTAACGTCA 246480 

ATCAAC CTTC CTCAATACCG AAAGAACTTT ACAACCCGAA 246 54 0 

CATGGCTGCG TCAGGGTTCC CCCCATTGCG CAATATTCCC 2 4 6600 

GTCTGGACCG TGTCTCAGTT CCAGTGTGGC TGGTCATCCT 24 6660 

CAGGCTTGGT AGG CATTTAC CCCACCAACT ACCTAATCCC 2 4 6720 

ATGCGGCCTC TCAGTCCCGC ACTTTCATCT TCCGATAATA 246 78 0 

CCCGTCGTTA TCCCCCTCCA TAAGCCAGAT TCCCAAGCAT 24 684 0 

GTCAG CAAGA AAGCAAGCTT TCTCCTGCTA CCGTTCGACT 24 6 900 

CCAGCGTTCA ATCTG AG CCA TGATCAAACT CTTCAATTCA 246 960 

TGCTTAGCTA TAAATAAAAC ACTACTTAAA AAAGTAATGA 2 4 702 0 

TATTAAGACT TCAAAATTAA AAATAGTTTT AAAACAAGTC 24 708 0 

GATTGTCTGA TATATTGTTA AAGAGCAAAA AATAACGACG 2 4 714 0 

TTCACAACAG TGCGTCGTTG TGTGCGGTGC ATTATAGAGA 2 4 7200 

ACTTTTTTGT AAAAATATTT AAAAAAATGA TCTTTTGATG 24 7260 

AAAAGTGCGG TTAGAATAGT CAAACTTTTA TTGCTATTAT 2 4 732 0 

TCCACCATTA AACATCTGAA TTTTATTGCC ACATAATACT 2473 80 

TATGC CTTGT TCAATAC CTG AAATAACGCC TTGCTCAGTA 2 4 7440 

TGAAAACGCA TTATATGATT GCCATGCTTG TTGAAATTCC 2 4 75 00 

AAAGATATTT AAACGTGTAT ATAAATGTTG TATAAGTTTG 24 7 560 

TACATCAGGA TCAATTTCAC ACACTTCCGC ATAAGG CTGA 2 4 7620 

CAACGACACA TTAATACCTA TGCCAATCAC T AAATTG AG C 24 7680 

AATTTCAACT AAAATGC CAC CTAACTTTCT TTCATCAAAC 2 4 774 0 

CACTTGTACA TTTAGCACTT CCGCAATCGC CAAACCGATG 24 7 8 00 

TGATTTTTTA GGATCAAATG CCCAATAAAA ACTAAACATG 2 4 7860 

TAACCATTGG CGACCACGCC GACCTCGTCC TGCAGTTTGA 247 92 0 

GCCTTTTTTT AATGAAAGAA TATTTTGTAG TATCCATTCA 24 7 980 

CTGATAATGG ATACTATAAG GAAAAAGTGC GGTTGAAATT 24804 0 
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TGCTGAGGAT TTAATAAAGG AAGCATTGGC ACTAAGCGAT AATCTTGTCC ATCAACGAGA 248100 
ATATCCAAAC CAATTTCACG TAATTGTTGA ATATCTTCCT CTAGATTTTT AGAAAATTTT 248160 
TCCAATTCAG ACCGCACTTT CGGCTGACAA TCCGCTAAAT ATGTCAATAA CGTAAAATTC 248220 
ATCATTCATT CCTTATGGCA AATAGTATGA AAATCATAGC ATAAATCCTA AAATTnnGTT 248280 
TTCAAAATAA AGTGAAATCT GTATAATCTC GTCGCAATAT TTTTTAACTT CAACTTATTA 248 34 0 
GCGAGAATAT TGCAATGTCA TTACGCATTA AACAAGAnGC CCTTACTTTT GACGATGTTC 248400 
TACTCGTCCC AG CTCACTCA ACTGTGCTTC CAAATACTGC CAACCTTTCA ACTCAACTTA 248460 
CTAAAGAAAT TcGTTTAAAT ATCCCAATGC TTTCAGCAGC AATGGaTACC GTAACTGAAA 248520 
CCAAACTTGC GATCTCTTTA GCACAAGAAG GCGGTATTGG CTTTATTCAT AAAAATATGA 248580 
CGATCGAACG CCAAGCAGAT CGCGTGCGTA AAGTAAAAAA ATTCGAAAGT GGTATCGTTT 248640 
CTGAACCAGT AACTGTTTTA CCAAATTTAA CGCTTGCCGA ACTTGCTGAA ATGGTGAAGA 248700 
AAAACGGATT TGCGGGCTAC CCTGTTGTAG ATGGGGAAAA TAATTTAATC GGTATCATTA 248760 
CTGGCCGTGA TACACGTTTC GTAAAAGATT TAAGCAAAAC AGTTTCACAA GTAATGACGA 248820 
AAAAAGAAGA CTTGGTGACC GTAAAAGAAG GGGCGAGTCG CGAAGAAATT TTGGaATTAA 248880 
TGCATCAACA CCGTGTTGAA AAAGTATTAG TGGTTAATGA TAGTTTCAAA CTTAAAGGTA 248940 
TGATTACCGT TAAAGACTTC CAAAAAGCAG AACAAAAACC AAATGCATGT AAAGATGAAT 249000 
TCGGTCGTTT ACGTGTCGGT GCGGCTGTAG GTGCAGGTGC TGGCAACGAA GAACGTATTG 249060 
ATGCCTTAGT GAAAGCAGGC GTAGATGTAC TATTAATTGA TTCTTCTCAC GGTCACTCAG 249120 
AAGGCGTATT ACAACGCGTT CGTGAAACTC GTGCAAAATA TCCAAATTTG CCTATCGTTG 2 4 918 0 
CAGGTAACGT AG C AACGGCT GAAGGCGCAA TTGCATTAGC AGACGCAGGT GCAAGTGCTG 24 9240 
TAAAAGTAGG TATTGGTCCT GGTTCAATTT GTACAACTCG TATTGTAACT GGCGTAGGTG 24 93 00 
TTCCACAAAT TACCGCTATC GCAGATGCAG CTGCTGCATT AAAAGATCGT GGTATTCCTG 24 9360 
TAATTGCTGA CGGTGGTATC CGTTTCTCTG GCGATATTGC AAAAGCTATC GCTGCTGGTG 24 9420 
CAAGCTGTGT AATGGTTGGC TCGATGTTTG CGGGTACAGA AGAGGCtCCA GGTGAAATCG 2 4 9480 
AACTTTATCA AGGTCGTGCA TTTAAATCTT ACCGTGGTAT GGGCTCATTA GGTG CCATGG 24 954 0 
CGAAAGGCTC AT CAGACCGT TATTTCCAAT CGGATAACGC GGCAGATAAA CTCGTACCGG 24 9600 
AAGGTATTGA AGGCCGTATT CCATATAAAG G TTACTT AAA AGAAATCATC CACCAACAAA 24 96 6 0 
TGGGTGG CTT ACGTTCTTGT ATGGGCTTAA CGGGCTGTGC AACTATTGAT GAACTACGCA 24 9720 
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CCAAAGCGGA ATTTGTTCGC ATTAGCGGTG CAGGCATTAA AGAAAGC CAC GTTCACGATG 24 9780 

TTG CTATTAC TAAAGAAGCA CCTAACTATC GTATGGGCTA ATCTTTAAGG TGGGCTTTAG 24 984 0 

CCTGCCTTTT GCTGTTTAAA GTGCGGTTGG tTTTTGCAAA GTTTTAAATA ATATGAACCG 24 9900 

CTTACACTAT CAATAATAAG AGCATTATAA ATGACAAATA TCCACTATCA CAAAATCCTA 24 9960 

ATCCTCGACT TTG GTTCACA ATACACTCAA CTGATTGCGC GTCGTGTGCG TGAAATCGGT 2 5 0020 

GTGTACTGCG AACTTTGGGC TTGGGATGTA ACCGAACAAn AAATCCGTGA ATTTGCTCCC 2 5 008 0 

GAACTTTATC AAGGTCGTGC ATTTAAATCT TACCGTGGTA TGGGCTCATT AGGTGCCATG 2 5014 0 

GCGAAAGGCT CATCAGACCG TTATTTCCAA TCGGATAACG CGGCAGATAA ACTCGTACCG 2 5 0200 

GAAGGTATTG AAGGC CGTAT TCCATAT AAA GGTTACTTAA AAGAAATCAT CCACCAACAA 2 5 0260 

ATGGGTGGCT TACGTTCTTG TATGGGCTTA ACGGGCTGTG CAACTATTGA TGAACTACGC 2 5 0320 

ACCAAAGCGG AATTTGTTCG CATTAGCGGT G CAGGC ATT A AAGAAAGCCA CGTTCACGAT 2503 8 0 

GTTGCTATTA CTAAAGAAGC ACCTAACTAT CGTATGGGCT AATCTTTAAG GTGGG CTTT A 2 50440 

GCCTGCCTTT TGCTGTTTAA AGTGCGGTTG GTTTTTGCAA AGTTTTAAAT AATATGAACC 2 50500 

GCTTACACTA TCAATAATAA GAGCATTATA AATGACAAAT ATCCACTATC ACAAAATCCT 2 50 560 

AATCCTCGAC TTTGGTTCAC AATACACTCA ACTGATTGCG CGTCGTGTGC GTGAAATCGG 2 5 0620 

TGTGTACTGC GAACTTTGGG CTTGGGATGT AACCGAACAA CAAATCCGTG AATTTGCTCC 2 5068 0 

AACGGGTATC ATTCTTTCTG GCAGTCCTGA AAGCACAACT GAAGAAAATA GCCCTCGTGC 25 074 0 

ACCGGAATAT GTATTTAATG CTGGTGTACC AGTATTAGGC ATTTGCTATG GAATGCAAAC 2 5080 0 

GATGGCGATG CAACTTGGCG GCTTAACCGA AACGTCTGAT CACAGAGAGT TCGGCTATGC 2 50 86 0 

TTCCGTTTCT CTTGAAAATT CCACCGCACT TTTTGCAAAT CTTAACGATA ATTTAACCGC 2 50920 

TTCTGAGCCA AAATTAGACG TTTGGATGAG CCACGGCGAT AAGGTAACGC GCTTACCAGA 2 50 98 0 

AAACTTCAAA GTAACTGGCA CAACGCTAAC CTGCCCAATT GCTGCGATGT CTGATGAAAG 2 51040 

CCGTCGTTTC TACGGTGTGC AGTTCCACCC TGAAGTAACG CATACTAAAA AAGG CTTGG A 2 51100 

ATT ATT AAC C AACTTCGTAG TAAACATCTG TGGTTGCGAA AC C AAATGG A CAGCAGAAAA 2 5116 0 

C ATT AT CG AA GATGCCGTTG CTCGCATTAA AGAGCAAGTG GGCGATGATG AAGTGATTTT 2 512 20 

AGGCTTATCT GGTGGTGTGG ACTCTTCTGT GGTTGCATTA CTTTT AC AC C GTGCTATCGG 2 5128 0 

CAAAAACTTA CACTGCGTAT TTGTGGATAA CGGTTTACTC CGCTTACACG AAGGCGACCA 2 513 4 0 

AGTCATGGAA ATGTTTGGCG ATAAATTCGG TTTAAACATC ACTCGTGTTG ATGCTGAAAG 2 514 0 0 
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TCGTTTCTTA GGTGAACTTG CAGGCGTATC TGATCCTGAA GCGAAACGTA AAATTATCGG 2 5146 0 

TAAAGTGTTC GTGGATGTAT T CGATGATG A ATCGAAAAAA CTCACTAATG TGAAATGGTT 2 5152 0 

GGCACAAGGT ACGATTTACC CTGACGTAAT CGAATCTGCA GCAAGCAAAA CTGGTAAAGC 2 5158 0 

ACATGTGATT AAATCACACC ACAACGTAGG TGGCTTACCA GACTATATGA AACTTGGTTT 2 5164 0 

AGTTGAACCT TTACGTGAAT TATTTAAAGA TGAAGTACGT AAAATCGGTT TAGCATTAGG 2 517 00 

CTTACCAGCT GAAATGATCA ACCGCCACCC ATTCCCTGGC CCAGGCTTAG GTGTGCGTGT 2 51760 

ATTAGGCGAA GTGAAAAAAG AATACTGCGA TTTATTACGC CGTGCGGATG CGATCTTTAT 2 51820 

CGAAGAATTA CGCAACAGCG GTTGGTATGA AAAAACCAGC CAAGCCTTCA GCGTATTCTT 2 51880 

ACCAGTGAAA TCTGTAGGTG TGATGGGCGA TGGCCGTAAA TACGATTGGG TTATTTCCCT 25194 0 

ACGCGCGGTA GAAACAATCG ATTTTATGAC CGCACATTGG GCACATTTGC CTTATGATTT 2 52 000 

ATTAGG C AAA GTGTCTAACC GTATCATCAA CGAAGTGAAT GGAATTTCCC GCGTCGTATA 2 52 060 

CGACATCAGC GGAAAACCAC CCGCAACAAT CGAGTGGGAA TAAAAAATCA CTTTTTCAGA 2 5212 0 

TACTTTCTAA TTATTTCAGT AACTTTAATA AGTGCGGTTA ATTTTCACCG CACTTTCTTT 252180 

TTGTTTAAAA GCCCTTCTAT GCCAATCACA ATGAGACTTA ACCAAATAGG CACATAAGTA 25224 0 

AAATAGTCAC TTGTGGTTAT TTGATTATCT AGCACAAAGA CAGCTATTAA AAAAAGCAAT 2 52 3 00 

ATGGGTTCAA TATAACTCAA CACCGCAAAA ATACTAACTG GAATTTGTTG GCTTGATTTT 2 523 60 

AAGTTGGCTG AAAG CGAAAT CGCACTCACA ATACCAAGTG CTGGTAGCAA TAACCAATAA 2 524 2 0 

CGATATTCTT GCGTAACCAA AGAAAGCGTA TCGCTTTGAG AAAGAATATA AATAAAACAA 2 524 8 0 

GGAATCGAAA TTAAAATAAT ATCCAATGTA ATTCCTTGTA GAGCAGGGAT TTT CATC CAT 2 52 54 0 

TTACGG CTT A GATAATAAAA TGGATAAACC GTACAAACCC ACAAGCTTGT CCATGAAAAA 2 52 6 00 

CTTTGAGTGT GCCATAATTC ATGTGCAACA CCGCAGcAGC CAGTAAAAGC GCGATTTTCT 2 5266 0 

GAATAAATGA TAAACGTTCT TTTAACCAAG CCCATCCTAA AACAGCCATA ATCAGCGGAA 2 52 72 0 

AAAGAAAATA TCCCATCGCA AT ATTT AT C C CTTCGCCATT TAGCGGTGCC CACATAAAGA 2 527 80 

GCCAGAATTG ACTTCCCGCA TCTAGTGTCC CCAGTAAAAA TAAAACCCAA CGCGTCCAAC 2 52 84 0 

TTTTTCCTAA AGTTGTCGTA ATAAGAGACA AAAGAGAGCG GCAACCAATA GTCGGAAATA 252 900 

GAATTAAAAG TAACCCAAAG ATCATGGTCA G C ATACGCC A CGCAAAAACA TCCGTTCCGC 2 52 96 0 

TAAGTGGTTG GAG C CAT ATA CTAAATAAAT ACAATATGCC AAATAAACAT TGTGAAAAAA 253 02 0 

TGCACAATAA TATGCCTTGA GTGACTACTT TCATCTTGAA CTTCATTCAT CTCATCAATG 2 53 080 
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TGATTATTAT AAAAGCCCAA GGCAATAAAA AATCTCAACT TTATTTTTTA TTGTGATAAT 253140 
ATTTTTCTCA ATAGTTACAT TTTTGAGAAA TTATCTTATG CAAACACTGG ATAAATTAGA 253200 
TCGTCATATT TTGAATGTAT TACAACAAGA TGCCATGATT CCGTTAAAAG AACTGTCAGA 253260 
GAAAGTGAAT AGTTCCGTTG CAACCTGTCA ACGACGTGTG CAATCATTAA CTGATTCAGG 253320 
CATTATTACA AAACGAGTTG CAGTTGTTTC CCCAAAAGCC GTTGGGCGTA CGATTAGCGT 253380 
ATTTGTCATG GTCGAAATGG ATAATCAACA TTCTTATTAT CAAGAGCAAT TTGAACGGAA 253440 
AATGCGTCAA GAAGACGAAG TGGTGAGCTG TTACGAAATC TCAGGGGACT ATGATTTTAT 253S00 
GTTGCTACTC CATGCAAAAG ATATGGAAAG CTATCACGCA TTCACACGCC GAGTATTGAC 253560 
GGGTGAATTT CATGTAAGAA CTTATAAAAG TTTATTTGTA ATGAATTTTA CCAAAGCAGA 253620 
TAGTGGAATT ATCTTATAGA ACAAGTGCGG TAAATTTTTA CCGCACTTTG TGTATGATTA 253680 
GTTTAGTTTT GTCGTTTGTT TTAAGAATAA ATACCCAAGT ATTG CCGAAA CCGTTGATCC 253740 
AAGTAGAATA CCTAAACGAG AAAGAGTATT GACACTTTCC CCCGCATTAG CATCAAACGC 253800 
AAGGCTTGCT AAGAACATTG ACATCGTAAA ACCGATTCCA C AT AAAAC CG CAACAG C AAA 253860 
AATTTGTTTA AAATTAATTC CATCCGGTnA TTTAGCTAAT CCCAATTTAA CAGAAATATA 253920 
ACTAAATCCA AATATACCGA CTGGTTTACC AATAATTaAA CCGCTTGCTA TAGCCAATAA 253980 
TAATGGCGAC GAAATCATAT TCACATCTAT GCCTGCAAAA CTCACACCAG CATTGGCAAA 254040 
TG CG r AAAGT GGTAAAATAA CGAAAGATGA CCAAGAGGCT rAAATATGCT CGAAATCATC 254100 
TAATGGG CGT TCTCCTTTTT TACCTTTCAA TGGAATG CTA AAACCAATAA TAACGCCTGC 254160 
TAAAGTAGCA TGCACACCAG ACTTTAATAC GGAAGCCCAT AAAATTGCAC CTACG AC CAT 254220 
ATAAGCGCAA AGTGCAGAAA CTCTGAAACG ATTTAGTAAT ATCAGcACAA TAATTGCCAC 254280 
GGCAGAAAAA ATTAGTGCTT GCACACTCAA TCCGTGCGAG AAAAATAATG CAATAACAAC 254340 
TATCGCACCG AGGTCATCAA TAATAGCAAG GGCAAGTAAA AATATTTTCA GTGGAAGTGG 254400 
CACTTGTTTA CTCAATAACG CCATAATTCC CAGTGCAAAA GCAATGTCTG TTGCCATTGG 254460 
AATTGCCCAG CCATTAGCTA AACTTGGATC TTGTTTAGCG ATAAACCAAT AAACTACAGC 254520 
CGGTATAACC ATTCCGCCAA TAGCTGCAAT AG CAGGG AAA AT AG CTTGTT GATAGGTAGA 2 54 580 
AAGCGCGCCT TCAAATAATT CTTTTTTGAC TTCCATTCCC ACTAATACAA AAAACACCGC 254640 
CATAAAACCA TCGTTAATCC AGTGAATTAA GGTTTTATTA ATTGAGAAAC TAC CAATTTG 254700 
TAAACTTACA GGCAAATTTA AAAAATCATT ATATTGGTTG CTTAACGGCG AATTAG CCAG 254760 
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CAACATCGCA 


ACAACAGCTG 


AAAAAAGCAA 


TAATATACCG 


CCAGCAGATT 


CTAATTTAAA 


254820 


AAAACGTTGG 


ATCAGTTTTA 


TCACATAAAC 


CCCTTTAAAA 


ATACATAGCA 


AAAAATTCAC 


254880 


AAAGGAAGCA 


TCCTAGCATA 


TTTTCGAAGT 


TAAAGGAAAA 


AGTGCGGTAG 


ATTTTGTCGG 


254940 


GATTTTTAGC 


GAATTGTCGA 


TAAAATGCCA 


TATATTGTGA 


CAAAATTCAC 


ATTTTTAATT 


255000 


TTACACAATA 


GAGATTTTAT 


CTACTTTGTA 


GTAGAATGCG 


GACCTTTTAT 


TTTTAGAATC 


255060 


AAAAACTATG 


TTTTCACGCA 


AAGATATTAT 


CGTTTTAGGA 


ATGATGATCT 


TCG CCTTATT 


255120 


TTTAGGTGCG 


GGTAATATCA 


TTTTTCCGCC 


TATGGAAGGA 


TTTTCCTCAG 


GACAACATTG 


255180 


GACGAGTGCT 


TCCTTAGGCT 


TTGTTTTAAC 


TGGCGTTTTA 


ATGCCATTTA 


TTACGCTTGT 


255240 


GGTGGTAGCG 


ATTTTAGGTC 


GTGGCGAAGA 


ACTTACTAAA 


GATTTACCAA 


AATGGGCTGG 


255300 


CACAGGTTTC 


CTCGTTATAC 


TTTATTTAAC 


CATTGGCTCT 


ACCTTTGCAA 


TGCCCCGCAT 


255360 


CACTAATGTT 


GCTTATGAAA 


TGGCGTGGTT 


GCCTTTAGGC 


TTAACAGAAA 


ATAATG CGAA 


255420 


TGTTCGTTTT 


GTATTCTCAC 


TCATTTTTAA 


CTTAATCGCG 


ATGGG CTTT A 


TGATTAGCCC 


255480 


AAATACGATC 


ATTTCAAGCG 


TGGGTAAATT 


TATGACACCA 


G CTTTATT AG 


TGTTACTTAT 


255540 


TGCTGTAGCA 


ATTACCGTAT 


TTATTTCGCC 


TTTATCTGAA 


ATTCAAGCGC 


CAAGTAATGC 


255600 


TTATGAAAAT 


AGCCATTCTT 


TATTAATCGG 


TTTAACTAGC 


GGCTATCAAA 


CGATGGATGT 


255660 


ACTTGCAGCC 


ATCGCTTTCG 


GTGGAATTGT 


CGCCCGAGCA 


TTATCAGCGA 


AAAATGTGAC 


255720 


AAAAACCAAA 


GATATTGTGA 


AAT ACAC CAT 


TTCAG CAGGA 


TTTGTTTCAG 


TCATCTTACT 


255780 


TGCGGGTTTA 


TATTTTTCGC 


TTTTCTATCT 


AGGCGCAACT 


AGTGCTGCAG 


TTGCAGAAGG 


255840 


GGCAACGAAT 


GGTGGACAAA 


TTTTCTCTCG 


TTATGTAAAT 


GTCTTATTCG 


GTTCAGCTGG 


255900 


GACTTGGATT 


ATGGCAGGTA 


TTATTGTGTT 


AGCGAGCTTA 


ACAACACTTG 


TTGGTGTAAC 


255960 


TAGCGCATCT 


GCTGATTATT 


TTTCAAAATT 


CTCTGTGCGT 


TTTTCTTATC 


CATTCTGGGC 


256020 


CGCACTTTTT 


ACCGCAATGA 


CAATCACTGT 


TTCACAATAT 


GGTTTAACGG 


ATCTACTTCG 


256080 


CATAACGATC 


CCAGCCTTGT 


TATTAATTTA 


CCCAGTAGCC 


ATTGTGCTTG 


TTTTATTGCA 


256140 


ATTCTTACGA 


AAAAAACTTC 


CAT C AAT AAA 


ATTTACATAC 


AATAGTACAT 


TGCTCGTTAC 


256200 


AC^TTTtT'n "i "r 

<*>J 1 1 1 X. A. 


Ao I 1 lAlolu 


ALAbL 1 1 AAA 


TAACGTTAAA 


ATGTTACCAG 


AAAGCATCAA 


256260 


TTCATTATTG 


AAACACTTCC 


CTCTTTCATC 


TGAAGGAATG 


GCTTGGTTGG 


TTCCGACTTT 


256320 


AGTAATGTTA 


GTTGCAAGCA 


TTTTTATTGG 


AAAAGCATTG 


CATAAAACCC 


ATTCTTAATC 


256380 


TCACAAAGTG 


TTTTATTTAT 


CAGCCCTGTG 


CCGTAAAACT 


CCGTCCTTCA 


GGGCGGAGAT 


256440 
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ATAAGGCACA GGG CTGATAA ATTTCTAACC GCACTCTTTC AAAAATCAAA TTAATTTAAC 2 56500 
AGGTACTTTT ACGACTAATT TTTTAATTTT TTCTGTTTTT CCATTCACTA CACAGCAAGG 256560 
CTTGTGCGGT TCATAAGGAT AGAATACAGC GAACATTTTT GGTTTCATTG TGACGGTGAA 2 56620 
TTTATCATCA ATATCTGCAC AAAGTTGATA ATCATCCGCT TCGTTGTAAT CTTCATACTT 256680 
AGATAAATTC GGATAGGTTG CACCCACTTC GATATTTTCC GTGCCACGAA TAAG C ACTTG 2 5674 0 
TACATCTAAG TATTCATGAT GTAATTCTGC TTTTTTACTG CTTGGTTCTG CAGTTTCTGG 2 56800 
TTCCATTACG TTCATATAAA TTTGATCGTT AATATCATGG CGACCATTCT CTAACGCATT 256860 
CAAATCTAAC GTATTTAAAT AATCACATAC TTCAGCAATC ACTTTTGGTA AAC CAACTTT 2 5692 0 
AAAATTTGGG TTGGTTAAAC TGCTAATAAT CATAGAATTT TCCTTATAGT GGATAAAAGT 2 56980 
TACACTATGA T AC C AAGTC A GTTAAATGAA ATAAAGCCAC CGATTACATC ATCTCTCTTG 2 57040 
ATTAATCCAA TTAGCTCTTG TAGTACCGAG TGGAAGAGAG CAAATATCTT GCCAACTTAT 2 5 7100 
TGACGCATAA AGTCTAAACA ATGAAATTAC GGCATTAACA CAAGCATTTT CAGAAAAAAT 2 S 716 0 
ATTATCAAGA GGCTTATGAG GACCTGTAAA ATGAAAAATA TTAGGAGAAG TATCAAGAGT 257220 
ATTAATTGAA ATCATATGAA CTAATTGTAA TGTTGAATAG CGTTTAGCAA AATAAAGTTG 2 57280 
AAGATCATTT TGCCACGATT CAGAAATATC TCCTGTATGT AAAATAAAAA CAGAGAGATT 2 57340 
TTCATTGTGC ATTAATAAAG ATTTCAGTGT AATTTCTACA TTAAGGAAGA TAATTAGCAT 2 57400 
CACTAGCCAT CATTACATTA TATTTCATAT AAAAGAACCT AATTTCAGAG AGrACAATCA 2 57460 
ATTTTTAGGA GGAAAATACT GTCGAAACAA TGATAATTAA GTTGGTAATG TGAAATAAAG 2 57S2 0 
TTAGCCATTA AAGCCAAACA TCCTGTATAA TGCACGGGCT TTATTTTCTA CATT AG CGTG 2 57580 
CGG CTATCAA AAGAAATTTT TAAACGCCCA AAGTGCGGTC ATTTTTTCGA AAGATTTCTT 257640 
TTGCTAGTCG CAATCTGGGA AATAAGGCAT TGTTTTTAAT ATAAATTTTA AGGAAAACAT 257700 
TGTGAATCCA ATTGTTAAAC AATTTAAATA CGGTCAGCAT ACCG TTACTC TAGAAACTGG 257760 
TGCGATTGCA CGTCAAGCAA CGGCTGCGGT TATGGCAAGC ATGGACGATA CCACGGTATT 2 5782 0 
CGTGACTGTT GTTG CTAAAA AAGATGTGAA AGAAGGTCAA GACTTCTTCC CATTAACCGT 2 5788 0 
AAACTACCAA GAGCGTACTT ATGCGGCGGG TAAAATCCCG GGTGGTTTCT TCAAACGTGA 2 57 94 0 
AGGTCGTCCA TCTGAAGGCG AAACTTTAAT TGCACGTTTA ATTGACCGTC CAATTCGTCC 258000 
ATTATTCCCA GAAGGTTTCT TCAATGAAAT CCAAGTTGTG GCAACTGTGG TTTCTGTAAA 2 58 06 0 
CCCACAAATC AGCCCAGACT TAGTGGCAAT GATCGGTGCT TCTGCAGCAT TAACCTTATC 2 58120 
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TGGTGTGCCT TTCAACGGCC CTATCGGTGC AG CGCGTGTT GGTTTTATCG ACAAC C AATT 258180 
CGTATTAAAC CCAACTATGG CAGAACAAAA ACAAAGCCGT TTGGACTTAG TGGTTGCAGG 258240 
CACTGACAAA GCCGTATTAA TGGTTGAATC TGAAGCTGAT ATTTTAACTG AAGAACAAAT 258300 
GTTAGCAGCG GTCGTATTCG GTCATCAACA ACAACAAGTT GTAGTTGAAG CAATCAAAGA 258360 
ATTTGCAAAA GAAGCGGGCA AACCACGTTG GGATTGGGTT GCACCACAAC CAAACACAGA 258420 
TTTAATCAAC AAAGTAAAAG CGATTGCTGA AGCACGTTTA GGCGATGCAT ACCGCATTAC 25S480 
TGAAAAACAA TTACGTTACG AACAGATCGA TGCGATTAAA GCGGATGTAA yTGCGCAAAT 258540 
CACAGCTGAA GATGAAGAAA yCAGCGAAGG TAAAATCGTG GATATTTTCA CCG CACTTGA 258600 
AAGCCAAATC GTTCGTGGTC GTATCATTGC AGGCGAACCA CGTATTGACG GTCGTACCGT 2S8660 
TGATACTGTT CGTGCATTAG ATATTTGCAC TGGTGTATTA CCACGTACTC ACGGTTCTGC 258720 
GATTTTTACT CGTGGCGAAA CGCAAGCATT AGCTGTTGCA ACCTTAGGTA CCGAACGTGA 258780 
TGCACAAATC ATTGATGAAT TAACGGGTGA ACGTCAAGAT CACTTCTTAT TCCATTATAA 258840 
TTTCCCTCCA TATTCTGTAG GCGAAACAGG TATGATCGGC TCGCCAAAAC GTCGTGAGAT 258900 
CGGTCACGGT CGTTTAGCAA AACGTGGTGT AGCGGCTGTT ATGCCAAGCC TTGCAGAATT 258960 
CCCGTATGTT GTGCGTGTAG TATCTGAAAT CACTGAATCT AACGGTTCTT CTTCTATGGC 259020 
GTCTGTATGT GGTGCGTCTT TAGCATTAAT GGATGCGGGC GTTCCAATCA AAGCTGCTGT 259080 
TGCAGGTATC GCAATGGGCT TAGTGAAAGA AGAAGAAAAA TTTGTGGTGC TTTCAGATAT 259140 
TTTAGGCGAT GAAGACCATT TAGGCGATAT GGACTTTAAA GTGGCTGGTA CACGTGAAGG 259200 
TGTAACCGCC CTTCAAATGG ACATCAAAAT TGAAGGTATC ACCCCTGAAA TTATGCAAAT 259260 
TGCATTAAAC CAAGCGAAAA GTGCACGTAT GCACATTCTT GGTGTAATGG AACAAGCAAT 259320 
TCCTGCTCCA CGTGCTGATA TTTCTGACTA CGCACCGCGC ATCTATACAA TGAAAATCGA 259380 
TCCTAAGAAA ATCAAAGATG TGATTGGTAA AGGTGGTGCA ACAATCCGTT CACTAACTGA 2 5 9440 
AGAAACTGGT ACATCAATTG ACATTGATGA CGATGGTACA GTGAAAATTG CCGCAGTAGA 259500 
C AG CAATG C A GCGAAAAACG TAATGGGACG TATTGAAGAA ATCGTTGCAG AAGTTGAAGC 259560 
TGGCGTAATT TACAAAGGTA AAGTCACTCG TCTTGCTGAT TTTGGTGCAT TCGTCGCAAT 259620 
CGTTGGAAAT AAAGAAGGTT TAGTTCATAT TTCTCAAATT GCTGAAGAGC GTGTAGAAAA 259680 
AGTTAGCGAT TACCTTCAAG TGGGTCAAGA AGTAAACGTA AAAGTTGTTG AAATCGATCG 259740 
TCAAGGTCGT ATTCGTTTAA CAATGAAAGA CTTGGCTCCA AAACAAGAAA CTGAAATTAA 259800 
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CCAAGAAGAT TCTGTCGAAG AACAGGAATA ATCTTGGAAT ATAGAATCAA AACTTGGTAG 2 5 9860 

CATTCGCTGC CAAGTTTTGT TGTTTTCCAA CTTTTAGGCA TAAAGAAAGC AAATG CGGTG 2 59920 

TTTTCGCTTA TCTCGCCATT TTATAGTCTA TTTATTTAGC TTATGTGCGA TCTTACTCCT 2 5 9980 

TGCTGGCTGT GTTCAGTCTA GGGGCGGTTT TGTGTCTAAA AATCATGTCG TGTTGGCTGA 2 6 004 0 

ACAAAACCCG AATACT C ACT TTGAACAAGA AGTGATGATC GTGCGATTAA GCCAAGTACT 2 6 0100 

ACTGGTTGGA AAAATGAGTA ATGAAGAACG CGCATCGCTA CATTTTGAAC GTGGCGTACT 2 6 016 0 

CTATGATTCT CTTGGATTAT GGGGACTTGC TCGTTATGAT CTTACTCAAG CCCTTGCATT 2 6 0220 

ACAGCCGAAA ATGGCATCTG TGTACAATTA CTTAGGGCTT TATTTATTGC TTGAAGAAGA 2 602 80 

TTACGACGGC GCATTGGATG CTTTCAATAC GGTGTTTGAA TTAGATTCAG GTTATGACTA 2 60240 

TACCCACCTT AATCGTGGTT TAAATTTTTA TTATGTAGGG CGCTACCATC TTGCTCAGCA 2 6 0400 

AGATTTCTTA CAGTTCTACC AAGCCGATAA AAAAGATCCC TATCGTGTCT TATGG C TATA 2 6 0460 

TTTGAACGAA CAAAAATTAA AACCACAAGA AGCCCAAACA AACCTTGTTG AACGAGCAAA 2 6 0520 

AGGGCTATCA GAGGACTTCT GGGGTACACA TATCGTTCAA TATTACTTAG GACACATCTC 2 60580 

TGTAGAAGAA TTGCAACAAC GTGCAAGTGA ATTTGCCGAA AATTCACAAC AATATG CTG A 2 60640 

AATTCTCACA GAAAC CTATT TTTATCTAGC AAAACAAAAA CTCAATGTAG GGCTAGTAGA 26 0700 

TGAAGCAG C A GCTTTATTTA AACTAGCTAT GGCAAATCAG GTTTATAACT TTGTTGAGTA 2 6 0760 

TCGTTTTGCC GCGTTTGAAC TCATGAAATT AAAACCGGTT CAAACAGAAG ACGAAAAAGA 26 082 0 

AGAGAAAAGT GCGGTAACAA AAGCGATTGT TTTCTAAAAA TCAATCCAAA ACTTACCGCT 26 08 80 

CTTTG C AG T A GAACGAATAG AAAAATT CAT TCGCTCTGCT CAACTTAATA AACCGAAGGC 26 0 94 0 

CTCTTTGAGC TTTCCTTATT TATATTCGAG TATTTTAATG ACTGACAAAA TTACTTTCAA 261000 

TGACTTAGGC TTACCTGAGT TCATCTTAAA AGCTGTTTCT GACTTAGGTT TTGAAACACC 2 6106 0 

TTCTCCAATT CAACAATCTT GTATCCCGCA TTTG CTCAAT GGCAATGATG TGCTAGGTAT 2 6112 0 

GGCACAAACA GGTAGTGGTA AAACCGCTGC CTTTGCACTG CCTTTACTCG CGCAAATTGA 2 6118 0 

TCCAAGTGAA AAACATCCAC AAATGTTAGT AATGGCACCT ACACGCGAAC TTGCTATTCA 2612 4 0 

AGTTGCCGAT GCGTGCGAAC TATTTGTTAA AT ATG CAC AG GGCACTCGCA TTGTTACCTT 2 613 0 0 

ATATGGTGGC CAACGTTATG ACATCCAACT TCGTGCATTA AAACAAGGCG CACAAGTGGT 26136 0 

TGTGGGTACA CCAGGCCGTA TTCTTGACCA TATTCGTCGC GGAACATTAA ATCTTTCTGA 2 614 2 0 

ACTTCGTTTT ATCGTATTAG ATGAAGCCGA TG AAATG CT A CGCATGGGTT TCATTGATGA 261480 
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TGTAG AAAC C GTCATGGCAG AATTACCAGA AAATCATCAA ACTGCGTTAT TCTCTGCAAC 261540 
AATGCCTGAA CCGATCCGTC GTATTACGAA ACGTTTTATG AATGATCCAC AAGAAGTGAA 2616 00 
GATCAAAGTG AATAACGAAA ATGCCCCAGA TATCGACCAA AGTTGTTGGT ATGTACACGG 261660 
CGTGCGTAAA AACGAAGCCC TTCTTCGTTT CTTAGAAGTG GAAGATTTTG ATGCAGCGAT 261720 
TATCTTTGCT CGCACTAAAA CTGGTACTTT AGACATTACA GAACTACTTG AAAAAAATGG 261780 
TTTCCGTTCT GCCGCACTTA ATGGTGATAT GACACAACAA TTACG CGAAC AAACCCTCGA 26l840 
TCG CTTACGT AACGGCAGTT TAGATATTGT TGTGGCAACC GATGTGGCTG CACGCGGTAT 261900 
TGATATTGAA CGTATCAGCC TTGTGGTGAA CTACGACATT CCGCTTGATG CAGAATCTTA 261960 
TGTACACCGA ATTGGGCGTA CTGGTCGTGC AGGTCGTTCT GGCCGTGCTT TATTATTTGT 262020 
TGAACCTCGC GAGCGTCGTT TACTTCGTAA TATTGAACAC CTAATGAAAA AAGGTATCAA 262080 
TGAAGTAGAA TTGCCAAATC ATCTTGTGTT ACAAGAATGC CGCCGCAAAA AATTTGTCGC 2621 40 
CAAAATTACG AAACAACTCG AGCATCACGA TTTAGAACAA TATCGTAGTT TATTAGAAGA 262200 . 
CTTATTCACA GCCGATCAGG ATCAAGAAAA TATTGCTGCA GCAATGTTGA TGCTGCTTCA 262260 
AGGCAAACAA AAACTCATTC TTCCACCTGA TCCACCAATG GAAAAACGTC GTCGTGAACG 262 320 
TAATGATCGC GGTGATCGCC GTGAAAATCC ACGTTCAGCT GAACGTCGTG G CGAACGTAA 262380 
AGG CTATGG A AATCCACAAC CAATGGATTT ATATCGTATC GAAGTCGGTC GTGCAGATGG 262440 
TGTAGAAGTT CGCCATATCG TTGGTGCGAT TGCTAACGAA GGCGATATTA ACAGCCGTTA 2625 00 
TATTGGTCAT ATTAAACTTT ATGATGATTA CACTACCGTT GAATTGCCAC AAGGTATGCC 2625 60 
GAAAGAATTA TTGCAACAAT TCGCTAAAAC ACGCGTATTG AATAAACAAA TGCAAATGTC 262620 
TTTCTTAGGT GCAGTAAAAT CCGATAATTC CCGTGGCAGC GATGATTTTA ACGGTAAACG 262680 
TAAAGGTCGT GGTGGCGATT TCCGTGGAGA GCGCGGACGT GAACGCGGCA ATGATAATCG 262740 
TGGAAATCGT AAATTTAATG AAAAAAGTAA CCGCACTTTT AGCGACAAAC CACGCCGTGA 262800 
TCGTAGATCA TCATTCTAAA AATAAATCGT CTGATATTTT ACATACACTT AGCCCCTTGA 26 , 860 
ACCTCAAGGG GGCTTTTATT GTCTATAGTG AAAGTTTTGG CTACAATCCC ATAACTTTTT 262920 
TATATAGGAA AAATCATTGA AAGGTCTTTT TTTACGTATT ATTACTGCTC TAGCGTTATT 262980 
ATTTTGGG CT ATTGATATGG TGTTCCCATG GCAATTTTTA CGTCATACTG AAGAAAATCA 26 3040 
TTATACTG CA ATTCAAGCTA GGGGATCACT TTATGTAGGA ACAATAAATA ACCAGATTTC .63X00 
TTATTTTATT AATAAAGATA GCGAACGAGG CTTTGAATAT GAATTAGCCA AAG CATTTG C 2S3160 
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TGATACTTTA GGCGTAGAAC TCGAAATGAA AATTTTCGAT AATCAAGAGC AGTTATTTGA 263220 
TGAACTTGAT AAACACAATA TTGATTTAGC TGCCGCCCAC CTTTTATACC ATCCTAAAAA 263280 
TGCGGAACGT TTCCAAATAG GTCCTGCATA TCATTCTGCT TCATGGCAAT TAGCTTATCG 263340 
AAAAAATGAA AATCGTCCTA AAAATTTAGG TAACGTAAAG AAAGATATTT ATATCTCAAA 263400 
TAATTTAGCA TTAGAAGAAA CGCTAAAAGA ATTACAAAAA CAATATCCAC AATTAATTTG 263460 
GAAAAGAAAT CAAGCACTTA CTCAAGAAGA ACTATTACTA CAACTGGCTG AAGGCAAAAT 263520 
TCCTTACGTT ATTGGAAATT CTATTGATAT TGCTGCAATT CAACAAATCA AACCAGAGTT 263S80 
AGCCATTGCC TTTGATATTA CTGATGAAGC AAACGTTCAC TGGTATTTAC CCAATAACCC 263640 
CTACCGCGAT CTACAAACGG CTTTACTCAA TTTTATGAAT AATGCCGAAG AAACGGGACT 263 700 
TTTAGATAAT TTAAAAGAAA AATATTTAGG ACATATTTCA CAATTTGATT ACGTGGATAC 26 3 76 0 
GCGCTCTTAT ATGAACGCAA TTGAAAATAC CCTGCCACAA TTCTCACCGC TCTTTGAAAA 263820 
ATACAAAGGT GAATTAGATT GGCGACTACT CGCAGCAGTG GCTTATCAAG AATCTCATTG 2638 80 
GAATCCTGAC GCAACATCGC CAACTGGAGT ACGTGGCATA ATGATGCTCA CCAAAAATAC 26 394 0 
CGCACAACAT ATGAAAATCA GTGATAGAAC AGATCCTGAA CAAAGTATTA AAGCTGGTTC 264000 
AGAGTATTTA CACTGGTTGA TTAGTCAGCT TCCTGAAAGC ATTGAAAAGG AAGAAAGAAT 264060 
TTGGTTTGCC TTAGTTGCCT ATAACATAGG ACTCGGTCAC TTAATTGATG CACGCCGTTT 264120 
AACTCAAAAT CTTGGAGGAA ATCCAGATAA TTGGTTCGAT GTGAAAAAAA ATCTG CCTTT 26418 0 
ATTAGCAGAA AAACG CTATT ATTCCCAACT GAAATACGGC TATGCGCGAG GCTATGAAGC 264240 
CCATCAATAT GTAGAAAATA TTCGCCGCTA TATGAATAGC ATAGTGAATT ATCATCGAGT 264300 
CCAAGAAAAT CAAACAACGA ATGATAATGC CAACAACGAA AGTGCGGTTA AAAATTTAGA 26436 0 
AGAAATAAAA GAAAATAAGG ATTAATAAAA TGTTAAAACG TAAAACTTTT CTTAAAAAGA 2 64420 
AAGCTAATTT ACAACAAAGT TCTTTATCAC GTAATCTACG TTTAAGACGT TTAAAACACC 2 644 80 
GCAAACAACA ACAATTTTCG AG AC ATT CAT TACAATTTGT CGTACAAGAA ATTTGTTGCT 2 64 54 0 
AAGTAATCTA TTTTTCTTAT CAAAATAAAG AAAAAATAAC CGCACTTTTT TATGTAATTT 2 646 00 
ATATCATTAA AGTGCGATCT TTGTTTTACA ATAATCGTAA AATATATTGT GAAATTTTAT 2 646 60 
AGATTAATCA ATATTTTATT CTAATTTTAT CCCTAAGTTT CCAATTTTCA CACCGACTAG 264 720 
ACCAATCGAG ACGGGATCAA AATAATC C CC TAAAAATTTG AACAATTCAT TTAACTCTGT 2647 80 
AAAAGAACGC TGAATTTTTA TTTGATCCTC CACTTTACGC ACAAATTCAT AGCGTATGTT 26484 0 
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ATTTTCCTTC AAAATAAGCT GTTAGAGTTA AATAAATCGT TTCAAAGAAA TGGCTTTTTG 264 900 
CCCCCTCCTC TCCGGGATCA CTAATACGCA CATTCCACGT ATTATTAAGT AGAATTTGAG 264 960 
GCTGTTTTGA CATATTTTAC TCCTTGCTAT TTTTAACCTA AGAAAAACCC GATTATAAAG 26 502 0 
TTTCTGTAAT CAAAAAAATT TATAATCGGG CTTTTCAAAT ATTTTCCCTG GGTGTTTCGC 26S080 
CTTACGTTGG TAACTGCCTT TCCCTTTACG TTTTTTCTCA ACACGCTGAC GAAATAATTT 2 6 5140 
ATCGCCTAGC AATGCCATCA CTGCATTATC TTTTATCACG CCTCGAGTGT GCAAATAAAC 26 3200 
GGTTTCATTT TCAACCGCAC TTTTTTGTTT CTTTCTCATA AATTTTCCTT AAAAAAGCCG 265260 
CGGAAGGATT TCCACGGCTG GTGTATTATA CATG CAAACG TTTAAATTAC AAAATCTCAA 26 5320 
GTACAGTTTC TGGTGGACGC CCGATTTTAG CTTTATCGCC ATTAATAACA ATTGGGCGTT 2653 80 
CAATAAGTGC TGAATGTTCA CTTATCGCTT TAAATAATTC TGCTTGAGAA AGATCTAAAT 26 5440 
TATCTAAATT TAAGCTTTTA TATAGTTCAT CTTTCGTGCG CATCATTTGG CGAACATCAT 26 5500 
CAATTCCCAA TTTTTTGGCA ATGCTTTGTA ATTCATTAAC GGAATACTGC TTTTG C AAAT 265560 
ACAGTTCAAT AATCGGCTGA AT AC CTTTAT TTTCTAATAA TGCTAGCGTT TCACGGCTTT 265620 
TTGAGCAATG TGGGTTGTGA TAAATAATAA CAGACATAGG ATTTCCTCTT AGTAGCTATA 265680 
AACACTGTAA ATTTTAACTT ATAATGGCAA AAAAATACTT ATAGGAAAAG AAATTATGCT 26 574 0 
AGAAATGTTA AAAAGCTGGT ACTCACGCCG TTTAAGTGAT CCTCAAGCGA TGGGACTATT 265800 
AGCTATATTA TTATTTGGCT TTATCTCCAT TTATTTTTTT GGAGATTTAA TTGCGCCTTT 26586 0 
GTTGATTGCT CTAGTACTAT CCTATTTACT TGAAATACCA ATCAATTTTT TAAATCAATA 26592 0 
TTTAAAATGC CCAAGAATGT TAG CG AC AAT TCTTATTTTC GGTAGTTTTA TTGGTCTAGC 265980 
AGCAGTTTTC TTTTTGGTAT TGGTGCCAAT GTTGTGGAAT CAAACTATTT CCTTATTAAG 26 604 0 
TGATTTGCCC GCTATGTTTA ATAAATCTAA TGAATGGCTA CTCAATTTAC CTAAAAATTA 2 66100 
TCCTGAACTT ATTGATTATT CAATGGTAGA TTCTATTTTC AATTCTGTGC GAGAAAAAAT 266160 
CCTTGGTTTT GGCGAATCTG CTGTAAAACT TTCTTTAGCC TCTATTATGA ACCTGGTTTC 2 6622 0 
TTTAGGGATT TATGCCTTTT TAGTGCCATT AATGATGTTT TTTATGTTGA AAGATAAATC 266280 
AGAACTTTTA CAAGGCGTTA GCCGTTTCTT ACCGAAAAAT AGAAATCTCG CATTTAAAGA 266 34 0 
TGGAAAGAAA TGCAACAACA AATTTCTAAT TACATTCACG GAAAATTATT AGAAATACTG 266400 
ATTGTCACTT TAATCACTTA TATTATTTTC TTAATTTTCG GGCTAAATTA TCCGCTCTTA 266460 
TTGGCTTTTG CGGTGGGTCT TTCAGTTCTC GTCCCTTATA TTGGCGCAGT TATTGTCACG 266 52 0 
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ATTCCTGTTG CATTGGTTGC TTTATTCCAG TTTGGAATCA GCCCAACATT TTGGTACATC 266580 
ATTATTGCT T TTGCTGTAAG CCAACTTTTA GATGGAAATC TCCTTGTGCC TTATTTATTT 2 66640 
TCTGAAGCCG TCAATTTACA TCCACTAATT ATAATCATTT CAGTATTAAT TTTTGGCGGT 26S70 0 
TTATGGGGAT TTTGGGGTGT ATTCTTTGCC ATTCCTTTAG CCACTTTAGT AAAGGCCGTA 266760 
ATAAATGCCT TACCTCAAGA TTAAATAAT C TTTGAATTTT AAGTAGTCTA TTTCAATTAG 266820 
AGCTTTATAT AAATAAAAAA TAGTTAATCG TAAATTTTCG ATTAACTATT TTTTTAATTT 266880 
ATTCTAAAAG TAAATTAACT AAGGTAATTC ATCAATTTCT TTATCTACTT TAGGTGGGAT 266940 
CATATGTTCA CGACGTAAAC CAACATCGTA TGCAATGGCA ATGGCAACAA AAATCGAGGA 267000 
ATAAGTACCA AATCCAATAC CTACGAGTAA AGCAAGTGAA AAGTTATGAA TGGAAGGACC 26 7060 
ACCAAAGAAG AACAATGCCA TCACGACAAC TAATGTAGTA ACCGAAGTAA TGATAGTTCT 267X20 
TGATAAAGTT TGCGTTAAAG AAATATCAAT AATATCAATC GTATC CAATC GTCTAATTTT 267X80 
TCGGAAATTT TCACGAACCC GGTCAAATAC CACAATACTA TCGTTGATGG AGTAACCCAC 267240 
CACAGATAAA ATCGCTGCGA CAAAAGTAAG ATCAATTTCA ATTTGTAATG CAGAGAATAC 267300 
CCCTAGCGTA ATAATGACGT CGTGCGCAAG AGAAGCGATA CTGCCAAAGC CTAAACGCCA 267360 
TTCAAAACGT GACCCCACAT AAATAAGCAC CATTGCTAAT GTCGCTAAAG TCGCATATAC 267420 
CGCACCTTGT GCTAATTCTT CACCAACATT TGGGCCAACG AACTCAATAC TGCGAATTTG 267480 
AATGTCTTTA TCTACATTCT GTAGCATACT TTTGACGTGT TCACCAATGG TAGAATCATT 267540 
ATTACTTGCA GGTAAACGAA TCATCACATC CTGAACCGAT CCTGTGGTTT GTACAATTGG 267600 
GCTTTCAATT CCATTTTCGT GAAGTTTACT ACGAATTTGT TCAAGGTTAG CGGACTGCGA 267660 
GAAGTGAGTA TCAAATACCA CTCCACCAGT AAAATCTAAG CCCCAGTTAA ATCCTTTGGT 267720 
AATAATAAAA AATAGAGAAA TTACCATCAA AAGTGCGGAT AATATATAAC CCAATTTACG 267780 
CACTTTCATA AATTCAGTCA ATGGGAACGG GAGCTTTATC CCATTGATTT CACGGATAAA 267840 
ATGTCCGTCT TTATCTTTTG TAAAAAGTTT CATCATTTCC CGCCTAAATT AATAATTTTT 267900 
TAAGTTGTTT ACCACCGTAA AGGGCATTAA CTAATGCGCG AGTTCCTGTA ATCGCGGTAA 26 7 960 
ACATAGAAAT CGCAACACCA AGTGAAAGCG TAATCGCAAA ACCTTGAATT GGGCCTGTTC 268020 
CTACCGCGTA TAGAATAATT G CGGTT AAG A TTGTGGTTAA GTTTGCATCA AAAATAGAAG 268080 
TAAATGCGCC GTTATAACCT TCATTAATGG CTTGCTGAAT TGAACGACCA TTACGAATTT 268140 
CTTCTTTAAT ACGTTCAAAA ATCAATACAT TCGCATCTAC TGACATACCT AAAGTTAAAA 26 8200 
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TTAACGCAAT CACGATGCTA TCGCTACCTT CGGTTACTTT AGTAATTTTG CCGATTACAC 269940 
CACCAGCGGT TAAAACTTCA GTACCTTTTG CAAGCTCAGA CATCAATTTT TTGTGTTCTT 270000 
TATTGCGTTT AGCTTGCGGG CGATAAATCA TAAAGTAGAA AATTAAACCA AAGATCACGA 270060 
AAATAAATAG CGTGGACATT GGGCTTTGTG CTTCCATTGT GTTTTCCTTA TTAATTTAAG 2 7012 0 
TTAAAAATTT TGCGTAAAAT ACCATAAATT GCTTCATTGG GAAAGGCGAA CTAAGCCCAA 270180 
TATCCTACGT TATGGCGATT AATTTTATTT TTTCAAGATA ACGATAAATT CTTTTTCCGA 270240 
AGCGTAATAT TG CTCGACCA AAGCAACAGA ATTTTCTTCG GCAAAAATAG AAAAATTTTC 270300 
CACCGCACTT TCAAGGTTTA AGATCAACAT TAGCTCATCA TTTTTATCCA AATTTTTTAA 2 7036 0 
GGCTTTTTTA GCACTTAAAA GAGGAATGGG GCATCGAAGT GCGGTTAAAT TGAGCTGATA 270420 
TTTCATTCTT TAACTTGACG ACGAGCACTC ATGTGCTTGC CCATTTCAGT CAAAATATCC 270480 
GCACTTAAGC ATTCACGCCC CATTTCAAAT AACGGCTCTT C TAAGGCAAT ATGACGATCA 270540 
TAACCTTGAA TAAAGCGTTC AATTAAATGT TTATCAATAT CTTGACGTTG TTCAGAAATA 270600 
AGCTCTTCTA ACTGCACAGA TAATGCATTC CAATTTTCAT GCAAATATTC GTGCTGTTTT 270660 
TCTAATTCAT CAATGGAAGT TTGTGTTTGC GGAGCCACTT TGACTAATTG AGGGAAAAAA 2 7 0720 
TCTAATTCTT CATCATCGTG ATGTAACGGT G CGGCACG AT TGAAATAGCT TAAAATCTGC 270780 
TCAACATCAT TTAATACAGC CTGAGTATAG C CGTGTTTTT CTAGATAATC AGGCAAAATG 270840 
CTTAACTGAC GACAAAAACG TTTTACCTTG CTATGACAGG CATAAAGCAT TTCAATAGGT 270900 
TCATTCCATG TGG CGAATTG TTGAGGTTCA AG AATTTG C A TAAGATTTCC TTTGTTAAAA 270960 
AGTGCGGTCA AAAAACGTTT TATTTTTCTA CCGCACTTTT ATTAATTATG ATTTATCCGC 271020 
TAATTGTAAT GGCGGAACAG GTTTGCCCAT GCGAGCATAG AATTCCACCA CAAAATCATC 271080 
AAAACGATCA TCTTCAATCG CTTGACGAAT TTCCGCCATT AAACGTTGAT AATAGCGTAA 271140 
ATTGTGAATG GTATTTAAGC GCGCACCTAA AATTTCACCG CATTTATCTA AATGATATAA 271200 
ATAAGCTTTG GTGTAGTTTT TACAGGTGTA GCAATCACAT TCAGGATCTA ACGGGCTGGT 271260 
ATCATCACGA TATTTTG CAT TACGGATTTT GACAATGCCG TCTGTAACAA ATAAATGGCC 271320 
ATTACGTGCG TTACGGGTTG GCATTACG CA GTCAAACATA TCAATGCCAC GACGCACGCC 27138 0 
TTCCACTAAA TCTTCTGGTT TACCCACGCC CATTAAATAA CGTGGTTTGT CAG CTGGG AT 2 7144 0 
TTGTGGGCAA ATATATTCTA AAATTCGGTG CATATCTTCT TTTGGTTCGC CTACCGCTAA 2 71500 
ACCGCCCACC GCATAACCGT CAAAACCAAT ATTGACTAAG CCTTCTAATG ATACTTTACG 271560 
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ATTCTTPPGP 


AG A A APTGCT* 


uLAL I I TCAA 


TAGAACGAAC 


AGAAGTAGTT 


CCCACTGCAA 


272820 


TTACACGTTT 


GPPPGPTTTT 


1 1 lbi 1 GCGA 


TAATCGCATT 


ACAGACTTCT 


TGAGAAACTT 


272880 


C CACAT ACTC 


PGP ATGP ATT 


7a rriTr" ta tptt 

■HA-o I GA it 1 x 


/"TV TV TV '■••*•<> im« 

CAATATTTTC 


CACACGAACA 


GGCTGGAATG 


272940 


TTCCTGPGPP 


1 X G X Aft X 


G TAAC AAATT 


CAAAATTCAC 


ACCCTTTGCT 


TTGAGTTTTT 


273000 


CAAGTAAATT 


TTCATCAAAA 


TGTAAGCCTG 


CTGTTGGTGC 


TGCTACCGCA 


CCTGGTACTT 


273060 


TACTATAAAC 


CGTTTGATAA 


CATTCTTGAT 


CGGCTTCTTC 


ATCTGGGCGG 


TCAATATAAG 


273120 


GCGGCAATGG 


CATATGCCCA 


ATGGTTTG C A 


ACACATCTAA 


AAGAGCGGTT 


GATTTATCAC 


273180 


TTAATTCCAC 


TTCAAACAAT 


G C AC C ATG AC 


GCGCTTTCAT 


TAC CGCTTTA 


ATGCCATTAT 


273240 
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TTTCGCCTAA TTTATCTTCC CCTAAAAATA ATTCAGCCCC TTCTTTAGGG GATTTCGATG 273300 
AACGAATATG CGCTAAGAAA CGATGTTCAT CTAACATCCG TTCAACTAAA ACTTCAATTT 273360 
TCCCACCGCT GGCTTTACGC CCAAACATTC TTGCAGGAAT AACGCGTGTA TTATTAAAAA 273420 
TCAACAAATC GCCTTCATCA ATTAAATCTA ATACATCAGA AAAAGTGCGG TGAAAAATTT 273480 
CTCCGTTTTC GCCATTTAAC TGTAATAAAC GACAAGAAAC ACGATCAGTT TTAGGATAAC 273S40 
GAGCAATTAA TTCATCGGGT AAATCAAAGT TAAAGTCTGA AACACGCATA ACAAAATTTA 273600 
GGTTAAATAA AATAGGCGGT AAGTCTAGTT TGAATTGAAA ATGATCTCAA GAAAAAATCT 2 7 366 0 
CCCCTCGGGG AAGATAAAAT AACCAGCAAT TATGCTGGTC ATTTGGGATT AATTATAAAT 2 7 372 0 
TTGCTTTTTT AGTTGCTTTT ACGTTTACAT CACAATTTTT AACGATGATG TTATCG CTAG 273780 
ATTTACCTTT TTGAATTAAA TTTACAGGCA CAACAGAATC AAATTTATCT AACGTTAAGC 273840 
CACTATCTAC ATTCCAAACA TAATCCCCAG AAACGAAACT TGTAAAGTCA TTTTG TGATT 273900 
TATCACGAGT AAAATCTTTC GCAATAATGC CATCGCCCAC ACTTACCATT GCAGCAACTG 273960 
GTTCTTGGTT TTCAAATTGA TACACAGCAG TCACCGTTTG TTTATTGCAG GAATATACAA 274020 
CTGTTTTATC AGTGATTGTT GGAAGTGTTT GCATTTCAGC TATTTGCGCA GATGATTTTG 274080 
TCATTTCAGG TGCATCCTTT GCACAAGCAG ATAATACGAC TGCAGCAGCA AATGCTGGTA 274140 
AAAGTTTAGT TAAATTCATT ATTAAATCCT CTTAATTAAT CCCCTATATG GGGTGCTACT 274200 
TAGATGCTCT TAATTAAATA AAGTTCCATT ATTGAAATAA AAAAGGTGGG ATCTGCCCCA 274260 
C CTTGTTTAT CAATAAACGA TACAATTATG TTGATTTTTA CCAACGATAC CCCAATTTCA 274320 
CGCCCACATT TTGCTGTTTG CCGAGTTGCG AACCTTGAGA TTTAGAAATA AAAGCAGAAA 274380 
GTTGGAAATG TAAAATTTCT GCCTTTAATC CCACTTCTTT TTGCCAATAA CGCCCAAATG 274440 
ATTGTkGCAA CATCGTGCTA TTTACAGTGG TTTGTACGTT AGCGTTTGAA ACATCAACAT 274500 
AATTGACAAA GAAATAAGGC TTAACGCTGA TATTATTTGT CGGGGTAAAT GTATAATCAA 2 74 56 0 
CTCGAATGCC AG C ATT AT AG CGATTAAATG CAAGGCTCGG TGTTTGCACT TTCACTTCTT 274620 
CAGATTGATA ATTTTCACGT TCAATAAAAT ATCGATTAAC ACCCAAATAA GGCTGAATAC 274680 
CCAATTGCCC TAAACGGAAC TGATAACTTG CATTCACTCC ATAATTTATC GCTTTTCGAT 2 74 74 0 
GAATTTTTCG GCTTTGTTCT TCAG CCATTT TACTCGCACT AATTCCCGCA CCCACATTTA 274 800 
CACCAAATTG TAAATCGCCC CATTGATATT GGGCAAAACC CGACATCATC GTTAATGTCG 274 860 
CGTGATTTTT AACCTGTTCG TCAAAGGTAT TATCTGAACG GCTATGCGAG AAAACCGCCC 274920 
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CAATTCGTCC ATTATCTAAG GCTTTTTGCA CCCCAATTTG ACGCAAGTnC GTTTTCTGCT 2 74 98 0 
GATAAGCACG GAACGCATCA GAATCATAGC GTCTTTTATC CTGTGCGATA TTTGTCCACA 275040 
GGGCAGATTG TGCTtGATCT ACAAAAAGAC GATCCAATTC ATCTTGAACG GAAAACATAC 27510 0 
TATTTACTGT TGCAGACAAC TCCGATAACG CACTATTTGA GTAACGGCTG ATCAATTCTT 275160 
TTTGTTTGCC TTGTTCCTCT TGATCTTGAG GTTCTTTTTT CACTTGCGGT TGGGCATTAA 27 5220 
TAACCTCAAG TTTGACTTCG AAT AC CTTT A ACTGGCTTAG ATCAAACGGG GTATCAGAAA 275280 
ACGCTGCACT CAATGCTCTT TTTGACCGCC CTTTTTTTGT TTTTGGCTTA CTTGTTTGTG 275340 
TTTCAGTAGT CTGTTCAACT TGTTTGGCTT CTAATGCGTT TAACAGGCTT TGAGCAGACG 275400 
GGGTATCAGA CAACACCGCT CTTCTTGACC GCGCTTTTGC CTTACTTTTT TGTTTTTCAG 275460 
CAGTCTGTTC AACTTGTTTG GCTTCTAATG TTTGTTCTGC TTGCTCTGCT CTTACTAAAT 27 5520 
CATTGAGCAA TTCCTGCTCT TTTATTGGGT TATGCAAGCG GAATTCGCCC TTATTCTTCA 27 5 580 
CTAATTTATA ACGTAATG C A CCTGCATCAA CGTGGTCATT TTCTAAAGTA AATTTG AG CT 275640 
TATCTGATAA CGGTTTATTA TCTAAGCTTT CAATCAAAGT TAATTGCTCA AGGGTCACAG 2 757 00 
GTTCTTTGCC TGTGTTGCGA ACAGATAATG TGTAATCGCC CTCAGCGTCA TTGGATAATT 275760 
TTAATTTATC GCTTTTATAG CCAAATAAAG ATGAAGTAAA TTGGAATGTG CCTTGCCCGC 2 7 582 0 
TCAATTTACC ATTTACTGTC AATGTGTTGA AACGATGTTC TGCCGATGTT GGCGTTGTTT 2 7 5880 
CCGTCTCTAA TGAACGGCGG TGACGTGGCG CATTATTTGA GCTAGCTGAA TAAGCTGAAT 2 75 94 0 
TTAACGTAAC AGTACTATTA TTTAGCGTTA AATTCTGCAA TGTAGTATCG CTAGGCATTG 2 76 0 00 
TCCAAGTCGC ATTTTCCAAC GTCACTGTTG TGTCTTTGTC GCCCTGAATT TGGTGCCAAA 2 76 060 
AATGGCTGTT TTTTAAAGAA AATTGAGCAG AATCCGTTAA ATGCACATTG CCGTTTAACG 2 76120 
TGGCATTATT TACCGTTGCA TTTGCGTGAT TTGAAAGTTT GATATTGCCT GTTTGGGTGG 2 76180 
CATTGTTG CT CAATGTAAAT TGGCTGTGAT CTATTAAAGT GACATTACCA TTAAGTTTTG 2 7624 0 
CTAAACCATG AATATTCACT GTTGCATTAT TAGTTAAATT AATAGAGCCA TTAATTTGTG 2 76300 
TTATGGGTAT GGAATTAATA ACTTTTGTAT CGGTTAAATT AACTGTTTTA CAAGTCGTTA 2 76 36 0 
ATCCTGTCCA ATCTGAACGC GTGCAAATGG TATTTTGTTA ATTTGGCACA ACACCAAATG 2 764 2 0 
TGGCATTTGC ATTATTGCTG ACTGTCCAAT TTCCCTCAAT TGAAGAAACA TTGCGAGAAA 2 76480 
CCACCGCACT TCCGCCTTTA ATTTGGAAGT TTTCAGCTTT AAATGTACGG TTGAwCCAAT 2 76 54 0 
CGTAATCCCA CACAATTTCG CCTTGTGGGA TACCTTCCAT TTCTGACCAA CGTTTGTCTA 2 76600 
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AATGATTGTA GGCGTGGGGT GTCGGTCTAC CGCTGAAAAA TAGTTTACCT TTTGTTTGGG 276660 
TAATATCGCC TTTTAAGTTT GTGCCACCTG AAAGTAGCAA AGTACGATCT TCTGTGGTTG 276720 
GTTTATAAAT AAGGTTTAAT CGTCCATTGT GTTTATTTTT ATCTGTTTCG CCAAACCAAC 276780 
CGTTGTAGGC AATTTCTTTG CTGTAATCAA GTTTATTAAT ATTCTTTTTA TTAGATGGAG 276840 
CAGTAATACT TTCGTTCCCA GTAATAGTAA TATTAGCGAC TTGAGTTGTA TTATGGTTCA 276900 
CAATCATCGC GCCCTCGTCC GTATTTTGGA TACGCTTAAA GGTTAATGAA TGCCCGTTAA 276960 
GATCTAAGCG ACCACCACGA AATCCGAAAT AGATATTGTT AGGATCAACT TGATCTGTAC 277020 
TATTTAATTG AACTGTTGCT CGACCGCTTA CAATGCCAAC TTCTTTGAAA GCTTGTTTTT 277080 
GACCCGCTTC ATCTGCTTGT TGATCTAGAA TGACTTTGCC GTTACCCGCA CTTAAACTTC 277140 
CTAAATTCTT TCCCTTACCA TTGACTAATA AAGTGCCTAT ACCAATTTTA GATAAACGAT 277200 
CATTTTCAGG ATTGTGAACT TTCCATTCAA CAGTTGCATC TTGTCCAATA G ATACG CCTG 277260 
CACCTTGCCA AGTTATATTA TTTTGCTTGC CTTTTACAAC AAAATTACCT TCAAAGTATA 277320 
ATCCACCAGC ACCTTGATTT ATATTATTTT CGATTGTTAA AGTTCCTTTT CCTTGGTCGC 277380 
CAAGGTAAAT ATTTTTTCCA TATTCCATTC TTGGTTGGTA AATATTATAG CCTGCTGCTG 277 440 
CTTTAACATG TTCTTTAGCG GTTTGATTTA ACGATGGATT AAATAACTTC ACAGTTCCTA 277500 
CTTCTGATTT TGCTTTAGAG CCATCTTTAC TAGGTCTAGT TAAAGTTAAT TTACCTGTAC 277560 
CATCATTATT TGATACAAAG GAATAATGTC CATTTATTGG GGGAATATAG CGTTGAAAAA 277620 
CACTAGGGGT ATCAACCGCA AGAACTTCAT TATAAAACCA TTCTTCACGT ATTAACTGAA 277680 
ACCCATTACC TCTTCCGAAA AAAGGATGCC CAGTTTGTAA TACAGCATTT ATAAGCCATT 277740 
GTTTCTTCTT CGCATCATAA ATAAACATTG GCGAACCGCT ATCGCCTTTA GAACCACCCG 277800 
TTGGTAATGG ACCATAATGA TTAGGGCTAA CT AC ATTAC C ACTAAGATTG ACTGTACCAT 277860 
TACCATTTCC ACTCTGAGTA TGGGTATTTC CTGCAGTAAG ATAACGATAT GCACCTGAGA 277920 
CATAATATGA ACTATGTACA TTCGTTTCTT CATCTTTATC TGTACGCCAA TACTGACGTC 277980 
CTGAGCCTAT ACGTACACGC TCAGGATAGT TCTCTCTATC AGCATATACT TTTCCATCCA 278040 
TATTTGTTGT CATACCCACA GGTTCAGCTT CAGTTACAAA TTTATGTAAA CGAGG CAT AT 278100 
GATAATCTCC ATCATAAGGA TGCTTTCTCT CCCAAGCTTG ATAATTATTT CTTTTTACAA 278160 
TTTGATAAGT AAAGCGGTGC TGATCAGGGT TTCGTCCTTC TGCACCAAAA TCAACATCGT 278220 
TATATCCGCC GTTATGTGCC ACGCTCACAA TATACTGATC GCCTACTAAT GCCGCCACGC 2782 80 
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CGTTACGCGA CACCACGGaA AAATCAATCA TCGGGGCTTT TGTCATTGAT GTGCCAACTA 278340 
ATTGCCCTTC TTTGTTATAA ACCTCAATAT TTTTAGCCCC AACTGTGAAC TTCCCTTTAT 2 78400 
TCTCGGCAAA ATCACGATAA TATTGGTAGT C AATC CC AAA ATAAGTATGA CCTGCCCAGG 278460 
CTTGTG ATG C TATCCCTAAT GAAACACAAG CGGTTAAAAA ATTAAGACGA AATACAGTTT 278520 
TTTTCATAAT GTTTATTTAA TTAAG T AATT TTTCGTATTA AAAAATACGT GGTTAAACGA 278 580 
CTATTGAACA ATAAAAGTGC GGTGATTTTC TAAGGGATTT CAGCTAAAAA ACGAACTAAA 2 7864 0 
AGGCACAAAA GAAAATAAAG CAGTAAAACT ATATAGTAAT TAAGCCAAAA TTTCTAGGTT 2 78 700 
TTGGCTTAAT ACTTAGTGGG TAATCTTATC TTG CGATGGA TTAATATTAT ATGTCATTTA 2 78760 
TACAGATAAC TAAAATTAAA ACAATATCAC TAATAAAATA ACCGCACTTT ATGTGAGTTT 278820 
tCTCTGAAAG TGCGATCATT TTTtCTAAGG TTTTTCTAAA ATCGGTTTAA GGAAGCGAGC 2 7888 0 
GGTGTGGGAA CTTGTTACTT TAGCAACTTG TTCTGGTGTA CCCGTCGCAA TAATTTGTCC 2 78 94 0 
GCCGCCACTG CCTCCCTCTG GGCCAAGATC GACAATCCAG TCTGCGGTTT TAATCACATC 2 79000 
AAGATTGTGT TCAATGACGA CGATAGTATT TCCTTGATCG CGTAATCGAT GCAATACTTC 2 79060 
AAGTAATTGC TTAATGTCAG CGAAATGCAA ACCAGTCGTC GGTTCATCTA AAATATATAA 279120 
GGTTTTACCT GTATCACGTT TAGAAAGCTC AGTCGCTAGC TTAACGCGTT GTGCTTCGCC 279180 
GCCTGAAAGT GTTGTAGAAG ATTGACCTAA TCGAATATAG GATAATCCCA CATCCATCAA 2 7924 0 
GGTTTGTAAT TTTCTTGCAA TCATTGGAAT CG CATC AAAA AACTCGCGTG CTTCTTCCAC 2 79300 
CGTCATGTCC AAAACTTGAT GGATGGTTTT ACCTTTGTAA CGAATTTCTA AGGTTTCGCG 2 793 6 0 
ATTATAGCGT TTACCTTTAC ATTGGTCGCA AGGAACATAA ACATCGGGTA AAAAATG CAT 2 7 9420 
TTCAACTTTG AGTACACCGT CGCCCTGACA GG CTTC AC AG CGTCCACCTC GTACGTTAAA 2 79480 
GCTAAAACGC CCCGGATTAT AACCACGCGC ACGTGCCTCT GGCACACCTG CAAAAAGTTC 2 7954 0 
ACGAATTGGG GTAAATAAGC CTGTATAAGT GGCTGGATTT GAACGTGGCG TGCGCCCAAT 279600 
CGGGCTTTGG TTAATATCAA TAACTTTATC GAAATGTTCT AATCCCTCGA TAGATTGATA 279660 
AGGTGCGTAA TCCGTCTTTT CCGCTCTATT TAACGCATTT TGCGCAAGTG GAAATAAAGT 2 79 72 0 
ATCATTAATA AGTGTGGATT TTCCTGAACC AG AC ACGC C A GTTACGCAAG TAAACAAACC 2 7 9780 
AACGGGAATA TCTAAATTCA CATTTTTTAA GTTATTACCT GATGCGCCTT TAAGTTTGAG 2 7 984 0 
CCATTTTTTC TTATCAAGTG CGGTGCGTTT TTTAGGAATT TCGATTTTAT CTGCGCCCGA 2 79 900 
TAAAAATTTT CCCGTGATGG AATTTGGATT GAGCATAATT TCATCGGCAT TTCCTTGCGC 27*960 
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AATAACTTGT 


CCGCCATGCA 


CGCCAGCACC 


AGGCCCAATA 


TCAATAATAT 


GGTCAGCTGC 


280020 


ACGAATCGCG 


TCTTCATCGT 


GTTCCACGAC 


AATTAC CGTA 


TTACCAAGAT 


TACGCAAATG 


280080 


AATTAACG T A 


TTAAGTAAGC 


GTTCATTATC 


ACGTTGGTGC 


AAGCCAATAG 


AGGGTTCATC 


280140 


TAATACATAC 


ATTACGCCAA 


CAAGTCCCGC 


AC CAATTTG A 


CTCGCAAGGC 


GAATACGTTG 


280200 


CGCCTCCCCA 


CCTGAAAGAG 


TTTCAGCAGA 


ACGAGAAAGA 


GAAAGATAAT 


TCAAACCAAC 


280260 


ATTGACTAAA 


AACTGCAAAC 


GCTCGCGGAT 


TTCTTTAAGA 


ATTTTTTCCG 


CAATTTGTGC 


280320 


TTTTTG AC CA 


GTGAGAGAAA 


GTGCGGTAAA 


AAATTCGAGG 


GTTTCGCCAA 


TGCTTTTTTC 


280380 


CGCAATTATC 


GGCAAATTGG 


TTCTTCCAAT 


ATACACATTA 


CGCGCTTCGG 


GTCGCAAACG 


280440 


AGAACCGCCA 


CAATCTATAC 


AAGGTCGATT 


GCTAATATTT 


TTCG CTAATT 


CTTCACGCrC 


280500 


CGACATTGAT 


TCCGTTTCTT 


TATATCGGCG 


AG CCATATTA 


TTCAAAATCC 


CTTCAAAAGG 


280560 


GTGCTTGCGA 


ATAACCACAT 


CGCCACGATC 


ATTCATATAT 


TGGAATTCAA 


TTTCCTCTTT 


280620 


GCCTGAGCCA 


TGCATAATGA 


TGTGTTGAAT 


TTTCTTTGGC 


AAAGATTCAT 


AAGGGGCTTC 


280680 


AACATCAAAA 


TGATAATGTT 


TCGCCAATGA 


TGTAAGCATT 


TGATAATAAT 


AGAAATTACG 


280740 


ACGATCCCAA 


CCTTTTACCG 


CACCACCAGC 


AAGAGAAATA 


GTTGGATTTT 


GCACCACACG 


280800 


ATCTTCATCA 


AAATATTGCT 


GCACACCCAA 


GCCATCACAA 


GTTGGG CAAG 


CACCTGCAGG 


280860 


ATTGTTAAAG 


GAAAACAAAC 


GAGGCTCTAA 


nTCTGGCACA 


GAATAACCAC 


AATGCGGACA 


280920 


AGCAAAATTT 


GCCGAAAAAA 


CTAATTCTTC 


TGCTTTCGGA 


TTATCCATTT 


CTGCCACAAT 


280980 


TGCAGTGCCA 


CCTGAAAGCT 


CTAATG CGGT 


TTCAAAAGAC 


TCTGCTAAAC 


GTGTTGCTAA 


281040 


ATCTGACCGC 


ACTTTAAAAC 


GATCAACCAC 


TACTTCAATA 


GTATGTTTTT 


TCTGTAAGGC 


281100 


TAATTTTGGC 


GGATCAGATA 


AATCGCAAAT 


TTCGCCATCA 


ATACGCGCAC 


GAATATAACC 


281160 


TTGCGCAGCA 


ATATTTTCTA 


AAATCTTAAC 


GTGTTCACCT 


TTTCGATTTT 


TGACAACTGG 


281220 


CGCAAGTAAC 


ATCATCTTGC 


TGTCTTCTGG 


CAAACTTAAT 


ACTTTATCCA 


CCATTTGACT 


281280 


AATTGTTTGT 


GCCGTTAATG 


GAACATTATG 


ATCGGGACAA 


CGTGGCTCCC 


CTACTCGTGC 


281340 


AAACAATAAA 


CGTAAATAAT 


CATAAATTTC 


CGTAATTGTT 


CCCACCGTAG 


AACGTGGATT 


281400 


GTGTGAGGTA 


GATTTTTGTT 


CAATCGAAAT 


TGCAGGGGAA 


AGTCCCTCAA 


TAGAATCCAC 


281460 


ATCAGGCTTT 


TCCATTAAAG 


ATAAAAACTG 


GCGTGCATAC 


GCCGAAAGAG 


ATTCAACATA 


281520 


GCGGCGTTGC 


CCTTCCGCAT 


AAAGTGTATC 


AAAGGCTAAA 


GAGGATTTTC 


CCGAACCTGA 


281580 


AAGCCCAGTA 


ATCACCACAA 


GTTTATTGCG 


TGGAATAGTT 


AAATTAATAT 


TTTTCAGGTT 


281640 
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ATGGGTTCTA GCCCCGCGAA TATCGATATT TTCCATAAAA TAACtCAAAA GTATGAAATA 281700 
AATGTAAATT TTGCGATCAT TAT.CGCATAT TTTAATATCT GTG CAAATAT CCAGTTAAAA 281760 
TATATAGATC TTTTAAAAAA TTTCAGG CAA AATAACCGCA CTTTTTTAGA TAAAAAATGA 281820 

GGTTTTTTTA TGGCTGGAAT TAATAAAGTC ATTATTGTAG GTCATTTAGG TAATGATCCT 281880 

GAAATTCGTA CTATGCCAAA CGGCGATGCC GTTG CAAAT A TCAGCGTAGC aACCAGTGAA 281940 

AGCTGGAACG ATCGTAATAC TGGCGAACGC CGTGAAGTAA CAGAATGGCA CCGCATTGTA 282000 

TTCTATCGTC GTCAAGCAGA GATTTGCGGC GAATATTTAC GCAAAGGCTC TCAAGTGTAT 2820S0 

GTGGAGGGGC GTTTAAAAAC GCGTAAATGG CAAGATCAAA ACGGTCAAGA TCGTTACACC 282120 

ACAGAAATTC AAGGCGATGT AATGCAAATG CTTGGTGGAC GTAATCAAAA CGCAGGTGGT 2 8 2180 

TATGGTAATG ATATGGGCGG TGCGCCACAA TCTTCTTACC AAGCTCGTCA AACTAACAAT 2 822 4 0 

GGTAATAGTT ATCAATCATC TCGCCCAGCT CCACAACAAG CTGCACCACA GGCTGAGCCA 2 82 3 00 

CCAATGGATG GATTTGATGA CGATATTCCG TTCTAAATTA CATAGATTGA AAAGTAAATG 2 8236 0 

ATAAAAATCA AAGAGAAGCT GAATAAAAGA GAGCTTCTCT TTTTATTTTA CTAATTAAAA 2 8242 0 

GGACTAAACC TATGATTATA GATTTAGTCC TTAATTTTAT AGCTTTAGTT TTAGGGAAGA 2 82480 

TTAATTGCTA CCATTTATCT TCCATATCAC ATTTTATTGA AGAGTAAAAC TAATTTGCAC 28254 0 

ACTTAAACTC GAAGGAAATC CTGCGGGTCT TGTTCCAACA GAGCGAGATA CGTTAATAGC 282600 

GTCTAATGCC GCCTTATCTA AACTTTCATC GCCTGAGGAT TTTGTAACCT TAGCACCACT 2 82 660 

TAACGAACCA TCAGCTCCTA CATTAAAAGA TACACTCACT TTTCCTTGTT TGCGCATTAT 282 72 0 

CTTCGCTCGA GTTGGATAAC GTTTATGGCT TTCAATTTCA CGGCGGATTG CAGAACGGTA 2 82 78 0 

AGCCGCTATT TCACTCGTAT CCGTtCCACT TCCTGCTACC TGTGCATTGC TATTAGCTGC 282840 

GCTTGTnGTA CTTG CTTTAT CATTTACGTT GGCACTTGAA TCAATATTTT CATCGCCTTT 2 82 900 

TGGTAGCTCT TTATTGATTG GTTTCTTTTG TGGTTTAACC TCTTTTTTAG GTTTATt CTT 28-2560 

CGGCTTTCCT TTTGGTTTTC CTTTGGGTTT TGGCTTTtCC TTTTCTGGTT CTTTGATTTT 2 83 02 0 

TTTCGGTTCA GGTTTTATTG TTGGaTCTTC TACAATTTCT TGTTTTTCTG GTTCTGGCTC 283080 

TTTTTGTACA TTTCCTGGCT CTGGCTCTGG CTCTGGCTCT TTTTGTACAT CTTCTGGCTC 2 8314 0 

TGGAGCAGGT TCTTCCAACA CCATGCCCTG TAATAGCTCC ATAGAAATAC TTG TAG ATAT 2 83200 

ATCGCCTTGT GCACTATTTG CGCTATCACT TGGCTCATTC CAATTCCATA AG ATAAATC C 2 8326 0 

TATAACAATA CCGTGTGCGA TCAAAGAAAT AAGCAAACCC AATAGCGAAC GTTTTGTTTG 2 83 32 0 
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CTGCATAATT ATTCCTTACG GCGTAGTTGG TACAGATTGG GAAGGAGTAG ATTCTTGAGA 283380 
ATTTTTACCC GCACTTTTTC CCTTATCTTT CATAGAAACA ATTGCCACAT TTTTGATTTC 283440 
ATTTTTAGAA AGCATATCTG TAATCGTCAC AAAATCTTGG AAAGAGGCTT CGGCATCAAT 283500 
TTTTAAAGTT ACTTTCTGAT CTTTATTC CA TTGAGCAATT TCTGCCTCTA GAGCTTCTTG 283560 
AGAAATAGGT TTATCATTAA AATAGAG TTG TTTATCCGCT GTCACTGTCA ATAATTTAGC 283620 
TAGCTCGTCA GATTTAAATG CGACCGCTGT ACTAG CTTTT GGTACATTTA CTTGAATTTT 283680 
ACCTTGAGAG ATAAAAGATG CAGTAATAAG CACCACAGTA AGTAGCACCA ACATAATATC 283740 
AATGAAAGGG ATAATGTTGA TTTCATCGAA TTTTTTCACG AATTATTCCT TATCTTTTTG 283800 
TTCGCTTAAT ACTTTCCATT TTAAACGATT AACTTCAACT TTACGCCCCA AACCGTTATA 283860 
AAACACCATA GAGGGAATAG CCACTAGAAT ACCTAATGCA GTTGCTTTTA ATG CTAAAG A 283920 
TAAGTGCAGC ATAATTACAC TTGGaTCAAT GTCGCCACCG CCGTGACCGA TTTGATAGAA 283980 
AGTTAATAAA ATCCCGATAA CTGTACCAAG TAATCCAACA TAAGGTGCGT TTGCACcGAT 284040 
GGTAGAAATC ACTGTCATAT TACGGTTTAA ATCAATATCT AATGCGTGAA TAGTAGAATA 284.100 
ATGTGCCACG CTTACCTTTC TTAAAAAAAG ATAACGTTCA ATTACCATCG CAAGCATAAT 284160 
AATGCTCATC AGTAATAATA AACCGATGAT AAAATAATCA CTGTACTGCT GTAAAAAATC 284220 
AAAAAGTTGT ACCATTTTTT TTCCTTGTTA TTTGAGAAAT TTCTTTGAGA ATATAATTGA 284280 
GAATGAATTT CAATCATACG GGGGATTTTA GCAAAGGCTA AATAATGCGT AAACCTTGAA 284340 
TTGAAAAATT TCTGAATTTT TTACCGCACT TTATGTGGAA AAGTGCGGtT ATTTTTTGAA 284400 
ATATTATAGG GAACGAAGAT AATCAATGAT CATTTGATGA TGGTCTTTTA TCTTAAATTT 284460 
ATCAAATACT TGCATAATAG TTCCGCTTTC ATTGATTAAA AAACTAATTC GATGAATACC 284520 
GTCGTAAGTT CTCCCCATAA ATTTTTTTTC GCCCCATACG CCAAATTGCT CCGCAACTTG 284580 
ATGATCAGGA TCAGAAAGCA ATGTGAAATT AAGTTCTTTT TTCTCAATAA ATTGAGCTAA 284640 
TTTTTTCGGG GCATCAGGAC TAATACCAAG TACAACTAAA CCTAGTACAT CTAATTCGGA 284700 
TTTACTGTCG CGCAATCCGC AGGCTTGTGT GGTACAGCCT GGAGTGAGAG CCTTTGGATA 284760 
AAAATAAATT AAAACTTTTT TACCACGAAA ATCACTTAAA GAAACAAATT TTTCTTGTTG 284820 
ATTTAATAAG GTAAAAGCAG GTGCTTGATT ACCAACAGAT AATGGATTCA TAAAAAACTC 284880 
CTTTAATGAA AAAAAATTTG CAAATTAACG TTATTTTAGC GACTATAACA AGTGAAACCA 284940 
AATAATAAAA TTAAAAACCG ACGGAGGTTA TATGTCAGCG CAAAATTCTT TATTTTCGGG 285000 
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TAGCATAGTT GCCCTTGTTA CGCCAATGAA TCATTATGGC GAAGTCGATT TTTCCTGTTT 28 506 0 

AGAAAAATTA GTAGAACACC ACATTGAAGC TGGTTCTAAC GCACTTGTTT CTGTGGGTAC 285120 

AACGGGAGAA TCCGCCACGC TAAGTATTGA AGAAAATGTA AAAGTCATTG AAAAGACTGT 2 8 5180 

TGAATTTGCC AAAGGACGTA TTCCTATTAT TGCGGGAGCG GGTGCAAATG CGACGAGTGA 2 8 524 0 

AGCAATTACA ATGACCAAAT TATTACGAGA CAGTGGTGTA GCAGGATGTT TATCTGTTGT 28 53 00 

GCCTTATTAT AATAAAC C AA CCCAAGAGGG AATGTATCAA CATTTTAAAG CCATTGCGGA 2 8 5360 

ATGTACTGAT TTACCACAGA TTCTTTATAA TGTTCCTAGT CGCACAGGCA GTGATATGAA 28 5420 

ACCTGAAACA GTTGCTCGTT TAGCTGAAAT TGAAAATATT GTAGGCATTA AGGAAGCAAC 28 5480 

GAGAGATGTT AGCCGAATAG TTAAAATTAA GCAGCTTGCA GGTAAGAaTT TTATTGTTTT 28554 0 

AAGTGGTGAT GATGCAACAG GCTTAGAAGC AATAAAATTA GGTGCTGAAG GAGTAATATC 2 8 5600 

AGTAACAAAT AACATTGCAG CAAAAGATAT GGCTGATATG TGCCGTTATG CACTTGCGGG 2 8 566 0 

AGATTTTGAT AAAGCGGAAG AAATTAATGC TCGTTTAATG CGTTTACATC ACGACCTATT 28 5 72 0 

CATTGAATCT AATC CAATTC CAGTGAAGTG GGCAGCTTAT CGCTTAGGCT TAATTAAATC 2 8 578 0 

GCCACATTTA CGCTTGCCAC TTACAACACT TAGTGAAGAA ATTCAACCAA AAGTAGGAGA 2 8 584 0 

TGCGCTwAAA ATCGCAGGAT TAT T AT AAAA ATAAAGGGGT AAGTAATTAC CCCTCACTTT 285900 

TATGTATTAT AGATCAGTAA AATGAT C AAA AAGTGCGGTC AGAATTTATC AAATTTTCAA 28596 0 

TTTAGTTAAG GTTAATATAT GAAAAAAATA ATATTGAATT TAGTAACGGC TATTATTCTA 2 8 602 0 

GCTGGTTGTT CGTCTAATCC TGAAACCTTA AAAGCAACCA ACGACTCATT TCAAAAATCT 2 8608 0 

G AGACG AG C A TTCCCCATTT TTCCCCTTTG GCTACTGGAG GGGTACAGTT AC CAAAAG CT 28614 0 

GATGATTCCT ATTCATTACC TAATATTGAA GTGAAAAAAG GGGAAGATAT TGATATTCGC 2 8 62 00 

C CAC C ATT AA TCCCTCTCGC TATTATTCAA AATTCAATAA CAAAATTTGA TGGCGAACGC 2 8 626 0 

TCGTTGATTG TCTATCCAAA ACaACAGGCA AAACTATATA ATCTTCAGCA AGTTGAGCGT 2 86 3 20 

TTATTGAAAG AAGAGGGTAT TTCATCAACA ACAGATGGTT CGATTTTAAC GACTGATTGG 286380 

GCAAAAACAG AACGAATTGG CGATAAATCT ATAGAAATAA AATATCAAAT AGAACAAGTT 2 86 44 0 

ATGACAGCTG ATGTAAGTGC GCTTACCGTA TCTATTTTAC ATATGCGCCG AGATGGTATT 2 86500 

ATTTTTACAC CAAATGTATC AGATAAACAA TATTATACTT CGGAACGTTT AAATCGCATT 2 86 56 0 

GTTTTAACAC TCACTACCGC ATATAACAAA CAATTACGGG ATTTATCAAG CACTTTAATT 2 8662 0 

CAATAGGGCG TCATATAATA ATTTCTCTAA AATAATAAAT GTTTAAAAAT GAGACATAGA 2 86680 
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TCATCCTTTT G AAAATT AC T CTGATATTTA TCTTTAAAAC ACTTGAAAAT CTAAAATAAG 286740 
AACACTATTT TGTTATTAAC TAGAATTTAA ATCACGCAAC AAAGAGGTAA ATGCTATGAC 286.00 
AC TTAATATT ACCAGTAAAC AAATGGACAT CACTCCTGCA ATTCGTGAAC ACTTGGAGGA 286860 
ACGCACTTGC GAAGTTAGGT AAGTGGCAAA CGCAGTTAAT TAGCCCACAT TTTGTTTTAA 286920 
ACAAAGTAC C TAATGGATTT AGTGTCGAAG CCTCTATTGG CACACCCTTA GGCAATTTAT 286980 
TAGCCAGTGC AACTTCAGAT GATATGTATA AAGCAATTAA TGAAGTGGAA GAAAAATTAG 287040 
AACGTCAGCT TAATAAACTT CAACATAAAA GTGAATCTCG CCGAGCTGAT GAACGTTTAA 287X00 
AAGATTCTTT TGAAAACTAA ATACTGTAGA TAAAATCTAA TCTGGCAGTC TTATTGTAAA 287160 
ATAATATGTC TGTCAGATTA ATTTTAGATG AAACAATTAA ATCACTAAAA ATTGATGATA 287220 
TTAACTCAAA ATATTAATAA ATTTTATTAA ATATAGATTA ATTTAACCTA TCCTGCACGG 287280 
CGATGTAAAC TTCAGTACAT TCACGCTTAT CCACTACAAG AACATAAACG ACAACTTTAT 287340 
CTTCAATAAC TTCATAAACT TAAACAATGC C CTAAGAG CG AAATTTTATT T t ATAAAACA 287400 
CACCAAAAAA TCTAAAAAAC CGATCGG CAA ATACGTATGA nTTTCCTAAG CGTTTGATTA 287460 
ATTTTTTCTT AAATTGCTTT TAATAAAAAT AGAGACTAAT GAAAGATACA GATTAAATTT 287520 
ATATTTTTTA ACGAAATAGA TATTATTTGA CGGGATTAAT GGCGGTGAGA AAGGGATTTG 287580 
AACCCTTGAT GCAGTTTCCC ACATACACGC TTTCCAGGCG TGCTCCITCA ACCACTCGGA 287640 
CATCTCACCG TAAAAGGTGA GCAAACTATA AAGATTTACT CTGGTTTAGT CAAGTTTTTT 287700 
ACTCTAAAAA ATAAAAATCT TTAAAAGATT AGTTAAAAAA TAGGCAATAA AAGCTATTGA 287760 
GCTAATTCAG AGTTAGAAGA TACTAATAAT TTAGTAAGAT GTAACCTTGA TAATTATTAA 287820 
AACAGTTGAT TTTAATAGAT ACCATATATT AATTaTCTTT TATTCTCTTT CTTAATCTTT 287880 
TTATTTTAAT TAAAAAAGGG ATTTtCTCAA AATGATCATA CCAAGATGAT GGTATATCAA 287940 
TAATTTTATC CATTTCTTTT AAAAGTAAAT TTCCTATATG ACAATTTAAG TGACTG CATT 288000 
TTTTGTGCCA AAATTTATTT GGACCACAAT AATGAGAAAT AACTATTGGC ATTTTTACGC 288060 
ATAATAAATT TTCt TTTTAA TCAAATCTCG ATCCGTTGGT GTAAAATTAA ATCTATTATT 288120 
AATAAATTT t ACTTTTCCTT T AC AG AT AC C ATTTAGAATA TCTtGATCTT GGTATTTCAT 288180 
AACATTATTG TATTTATCAT CCAATTTATT GATTTTTGGa AAATATTTTC TTCTTTCCAT 288240 
TTATTTAAAT TAATTAATAA TATGCCCGCA TTAAAATAAG AATATCCTTC TAAACCAATT 288300 
GTTTTTTTAT AAGCTTCATT TTTTACATCA ATAAAAGTAT CTCGACATGC TGCCAAATAA 288360 
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TAGTtCGTTA 


TATCTATATT 


CCAAAGTTCT 


TGAAGTGAAG 


AGTTTGTTAA 


TGTGTCCACA 


288420 


TCAATATAAA 


TAGCTTTTTC 


TATATTCTTT 


ATGTATTTAG 


TTAAATTTAG 


TCTAGCGTAA 


288480 


GTAGCCAAAG 


ATATATAATC 


TATTGTTTTT 


GGAAAGTTTT 


GAAAATCTGA 


CTCACAAACG 


288540 


GGCAAGAAAA 


ATACTTTACA 


AGAATATGCG 


CTTGCTAAAT 


TAT TT ATT AT 


GGTCTTATTT 


288600 


TCCTGATTTA 


TTTTCATATC 


AAGAnTATAA 


AAATTTATTT 


TTTTAGGTGT 


ATTTTTAATA 


288660 


ATACTAAAAA 


TACTTACAGC 


TAAATAAGGG 


GCATAATAGT 


GGTCAGACGA 


AAATATAATA 


288720 


TTCATTGTCT 


GACTGACAGT 


CCGTCCGTCT 


GTCTGTCTGT 


CTGTCTGTCT 


GTCTGTCTGT 


288780 


CTGTCTGTCT 


GTCTGTCTGT 


CTGTCTGTCT 


GTCTGTCTGT 


CTGTCTGTCT 


GTCTGTCATG 


288840 


AAATTATCTC 


TGATTAACGT 


GTGAATTATT 


ATTTTTTGGG 


TTATTTTTTT 


GATAGATTAA 


288900 


ATAATTTTAT 


TTACTTTATA 


ATTTACTTTT 


T AAAAC ACT C 


AATG C CTTTG 


CTCGATGAGA 


288960 


AATTTTTTTC 


tTCTCAGCTG 


TTTCTAATTC 


AG C AAATGTG 


CAACCTTGTT 


CTGGACTAAA 


28902 0 


GAATAAACTG 


TCATAACCAA 


AGCCATTTTC 


GCCTTTTTCT 


TCAAAACCAA 


TCACGCCACA 


289080 


ACACTCACCT 


TCGGCGATAA 


TTGGCGATGG 


ATCCGTTGGA 


TGCTGTAAAA 


ACACGATACA 


289140 


GCTCACAAAT 


TTGGCTTGGC 


GTTGTTCTTG 


TGCAATATGA 


GCAAGCTCTG 


CCAATAATTT 


289200 


CTCACGGTTT 


TTAGCATCAT 


TGCCCTCTTC 


AC CAGCATAA 


CGGGCAGAAT 


AC AG AC CTGG 


289260 


TGCGCCATTA 


AGTGCGGAAA 


CTACTAAGCC 


AGAATCATCT 


GCAATAGCAG 


GTAAACCTGA 


289320 


TTTCTCTGAG 


GCATAACGGG 


CTTTTAATAA 


GGCATTCTCT 


ACAAAAGTTA 


AGCCTGTCTC 


289380 


TTCTGGGCTT 


TCAATGCCTA 


AATCTGTTTG 


TGCAATCACT 


TCAAAGCCAA 


AATCAGATAG 


28 9440 


GACATCAGCC 


ATTTCTTTTA 


CTTTTCCTTT 


ATTGCCTGTA 


G CAAGTACG A 


TTTTTTGTTT 


289500 


CATAATTAGt 


C CTTTT ATTG 


ATGATAAGCT 


GACGTTAAAT 


CAGCCCAGTT 


TTGCTCGGTA 


28956 0 


AATTGAATAT 


TCATTCGTCC 


AACTTCTTTT 


TCGAAGGAAC 


GTTTTAAACG 


GTTCAGATTT 


289620 


TGCACTTTCC 


AAAAATCTGC 


AGATTTTTCG 


CCGCACTTAT 


CAAAATCCAG 


TAACCAACAT 


289680 


TTTTGCCCTT 


GTTCAGTTTG 


TTG AAG C AAA 


ATATTATGTG 


CATTCAAGTC 


TGTATGGCAA 


28 9740 


ATCTGTAAAT 


CATGTAGTTT 


TCGAATCAAA 


CGACCAATTT 


GCATCCAAGT 


TTCTTTTGGC 


289800 




1 I I(j T AAAAG 


TGCGGTTAAA 


TCCTGCGCGT 


TTTCGATTTT 


TTCCGTCAAA 


289860 


ATATCCGCTT 


GATAGCAAAT 


ACCTAACTTG 


CCTTTTTGAA 


TACGTGCGGC 


AATGGGTTTA 


289920 


GGCACAGGTA 


ACCCCGCTTC 


ATACAAACGT 


TGCAACAAAT 


GAAACTCAGC 


AAAACTACGG 


289980 


GTAGTTTCAA 


GAG C AG AAAA 


ACGATAACGA 


TCTTTGTTCA 


GTTTACCCCA 


AAGTCCGCCG 


290040 
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CGATAATAAT GACGCAACGC ACAATTTACA CCAAACCAAT CTTCAGTTTG CAAAAAATAA 290100 
GTGATGCCTC GACCTTTTGC ACTTCCTATC ACTCG1TCTT GTTTTTGCCA AAATTCCGCT 290X60 
TGGAAAAACT CGGTTGCTTG CTCAAAAGTG CGGTCAAAAT TAAAGATAAA ATATTGGTTA 290220 
TCTTGTTGGA ATTGGTGCAT TGGAACGTTG TTATGAATAG AAAATGGACT TATTCTACTT 290280 
TAAAATAAGC ACAAATTAAA TGATAAAGGA ACGTCAATGC CACTTTTTAC CGAAGCTCCA 290340 
AAATCCCTTT GTATTCTCCG ATTGTCAGCG GTAGGCGATG TCTGTCATGC CTTAG CGGTG 290400 
GTTCAGCACA TTCAAGAATA TTATCCCCAA ACAGAAATGA CTTGGATTGT AGGAAAAACC 290460 
GAAATGGGTT TGCTTTCAAG TATTCCTAAT ATCACGCTTA TTCCTTACGA TAAAAAAACA 290520 
GGATGGAAAG GCGTTTTATC ACTTTGGAAA CAACTAAAAA ATAAACAGTT TGATGCGCTA 290580 
TTAAATATGC AAACAGCATT TCGTGCATCA ATCCTTTCTT TAGGAATTAA AGCAAAATTC 290640 
AAAATTGGAT TTGGAGAAAA ACGTTCTCGC GAAGGGCAAT GGCTATTTGT AAATCGTCGC 290700 
ATCAGAGATC CTTTTTCCCC TCACGTATTA GATGGATTTA TGGCTTTCGC TGAATATATT 290760 
GGCGTGCCGA AAGCAGAACC GAAATGGGAA TTAGCTATCT CACAAGATGA TTATAAATTT 290820 
GC CGATCAAT TTATTGATTT CTCTCGTAAA AATTTATTAA TTTCCCCTTG TTCTAGCAAA 290880 
GCAGAAAAAG ATTGGCTAAT CGAAGGTTAT GCAGAGGTTG CCAATATTGC CCATCAGCAT 290940 
AATATCAATG TAATCTTCTG CAGTTCACCA GCCAAACGTG AATTAGAAAT AGTCGAAAAA 291000 
ATTACCGCAC TTTGCCATTT CACACCAACC AATATTGCAG GGAAAACAAA C CT AAAACAG 291060 
CTTACCGCTT TAATTTCTAA GGTTGATCTA GTACTTTCGC CAGATTCAGG CCCTGCCCAT 291120 
ATAGCGACAA CACAAGGTAC ACCCGTTATT GGCTTGTATG CCTATCACAA TCCTTTGCGT 291180 
ACTGCACCTT ATAATAATTT AGATAACGTG GTATCAGTAT ATGAAGaAAA TGCCCAAAAA 291240 
GAATTTGGCA AACCGTCATC TGAACTACCT TGGGCAACGA AACTAAAAGG GAAAAATTTG 291300 
ATGACAGAGA tTCAAGTGGA ACCCGTCATT GgACAAATGa AAAAATTGGG GTTATTTTGA 291360 
AAAAATAATC TGTATCTCAA AACTTAGATG GTTTTAATAT AGCCAACTTG AAAGTTGTTT 291420 
TTGGAAAGTA AGGACTTTAG TTTTTTTAAG TTaAAAATCA ATAACTTGCT GTCAAAATAA 291480 
TTATATTTAG TCATAAAAAT CTATACCTTA ATCTCTCGCT TATTTAGTCC AATTTTTGCA 291540 
GGTACAGATT GAAATCCAAA ATTAGTTTGA ATTTTACGCA ATTAAATTCA AATGATTATA 291600 
AATAAACCAA TTGATTATTA TTATTATTAT TATTATTATT ATCGCCATTC TCCTCTTTTT 291660 
AATTATTACA ATAAGGAAAA CAGATGCGAT TTTCTAAACT TTCCCTTGCA ATTACAACAA 291720 
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CCTTAGTGAC 


AGCAAATGCG 


CTAGCGCAAT 


CCGTTGAATT 


AGACTCTATC 


AACGTTATTG 


291780 


C G ACACG AG A 


yCCAAGTAGG 


TTTGCTTATA 


CGCCAGAAAA 


ACAATCTAAA 


GATAGTCTTC 


291840 


TTTCTAAGCA 


AG CG ACTAGT 


GTTGCAGATG 


CGTTAGAAGA 


CATTCCCAAT 


GTTGATGTTA 


291900 


GAGGCGGTTC 


GAGAAGCATT 


GCTCAAAAAC 


CTAATATTCG 


AGGGTTAAGT 


GATAATCGTG 


291960 


TTGTG CAAGT 


CATTGATGGC 


GTGAGACAAA 


ATTTTGATTT 


AG C AC AT AG A 


GG t TCTTATT 


292020 


TTCTTCCAAT 


GTCACTCATC 


CAAGAAATTG 


AAGTAATCAA 


AGG AC CAAGT 


AGCTCCTTAT 


292080 


GGGGTAG CGG 


TGCATTGGGT 


GGTGTTGTGG 


CAATGCGTAC 


GCCAAATGCT 


TTAGACTTAT 


292140 


TGAAAAATAA 


TGACAAATTC 


GGAG TTAAAA 


TTCGCCAAGG 


TTATCAAACT 


GCTAATAATT 


292200 


TATCGGAAAA 


GGATGTTTCT 


GTATTTGCGG 


CAAATGACAA 


ATTCGATGTT 


CTTATTAGTG 


292260 


GTTTCTATAA 


TAATGCGGAT 


AATTTACGCA 


CTGGTAAAGG 


CAACAAGCTA 


AATAATACCG 


292320 


CCTATAAACA 


GTTTGGGGGC 


TTAGCAAAAT 


TCGGTTGGCA 


AATTAATGAT 


GCGAACCGCG 


292380 


TGGAATTATC 


CCACCGCGAA 


ACTCGTTTTA 


AACAAACAGC 


ACCAAGCAAT 


AATGAGGTGG 


292440 


AAAACGAACT 


TACTAATGAA 


CAAATTACAG 


ATCAAATCAA 


AAAGTTCCAC 


GGACAAAAAG 


292500 


ACGATCTTCT 


TCCTCCTACA 


ACACAACCAT 


CACCATCAGA 


AAGATCAGAG 


TTTTACTCTA 


292560 


AAGTGAAAAC 


ACGTTTAGGT 


AGTGTCAGTT 


ATTTAACTGA 


TCAACAAATT 


CCTGATCAAA 


292620 


GCACGGTATT 


TAACTATTAT 


TTAACGCCAG 


ATAATCCTTA 


TCTAAATACG 


CATATCGCAC 


292680 


TGTATAACAA 


TAAAACTATT 


GAGAAAGAAC 


AGCGTAAAGT 


CAGTGGTGTG 


AAAGATCAGA 


292740 


CTAAATTGAC 


TAC CCGAGGT 


ATAAATTTAC 


GTAATTCTTC 


CGAATTATCT 


CACATTTCCT 


292800 


TTGTTTATGG 


GGTGGATTAT 


ATGCGAGATA 


AAATCCGTAC 


CGAACGAGGC 


ACAAACGGTA 


292860 


GCGATGCGAA 


GTTTCGAG CG 


GACCCCTATA 


ATGCGAATTC 


AAACACTACA 


GGCGTTTATT 


292920 


TAATCGCCCA 


TATTCCGCTA 


TTTGGGGAAA 


AATTGCTAGT 


TTCGCCAAGT 


GTACGTTATG 


292980 


ACCACTACGA 


TACCTCAAGT 


AAAACCGTAA 


AATACAAGGA 


TAATCATTTA 


TCTCCTGCCA 


293040 


CAAAATTAAC 


TTGG AT AG TG 


ACCAATTGGT 


TAGATTTTAC 


TGCCAAATAT 


AATGAAGCTT 


293100 


TCCGAGCACC 


ATCTATGCAA 


GAGCGATTTG 


TGAGTGGTGC 


TCACTTTGGG 


GCAAATACTC 


293160 


TAGGGCTAGA 


TCACATCAAT 






TTTGCGCCCT 


GAAACAGCGA 


293220 


AAAATAAAGA 


AATTACCGCA 


AATCTACATT 


TTGATAGTCT 


GTTTAAACAA 


GGCGATAAAT 


293280 


T C AAAATTG A 


AGCGACTTAT 


TTCCGTAATG 


ATGTGAAAGA 


TTTTATTAAC 


TTAAAAATAT 


293340 


TTAATGATGC 


AAAGACAAGT 


GCAAGTGCAG 


GTGCAAATCC 


AAATACAAAT 


GGAGCATTGT 


293400 
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TGCCAAAAAA TTCCCAGTAT CAAAACATAA CTAATGCCCG TTTAAGCGGT ATTG AATTG C 293460 
AAGCTCAATA CCAAACAGAA CGTTTAACGC TATTTACTAA CTATGGCAGC ACCAAAGGTA 293520 
AAGATAAAGA TAGTGGCGAA GCTTTATCAA ACATTGCCGC AAGCAAAATC GGCGTAGGGG 293580 
TAAATTATGC TTTAGTAAAA GACAAATTCA CGGTGGGAGC GACAGTAACC CATTACGCTG 293640 
CTCAACGCCG AGTGCCTAAA GATCATAGTG TTACCTACCC AAGTTATATA CTGACCGATC 293700 
TTCGTGCTAC CTATGCACCA TTAAAAGGCG AATGGAAAAA CCTGCGTTTA GATTTTGCAC 293760 
TGGAAAACTT GTTTGATAGA AAATATCAAC CCGCGTTTAG TTTAATGGAA GGCACTGGTC 293820 
GTAATGCGAA AATTAGTGCG GTTTATAGCT TCTAAATTTA CTTTGGAAAA CGCCCTAAAG 29,880 
CTGTTTTC C C TCAATTTTTG TAATTTTATC AATTAATAGG ACATAAGGAA TGAAAATGAG 293940 
ACCTCGTTAT TCTGTTATTG CAAGTGCGGT CAGTTTAGGC TTTGTTTTGA GTAAATCAGT 294000 
GATGGCATTA GATCGG CCAG ATACTGGATC ATTGAACCGT GAACTAGAAC AACGTCAAAT 294060 
TCAATCAGAA GCCAAACCTA GTGGCGAATT ATTTAATCAA ACGG CAAATT CGCCTTATAC 294X20 
TGCACAGTAT AAACAAGGAC TGAAATTTCC ACTTAAGCAA GTACAGATTT TAGATCGAAA 294180 
CAATCAGGAA GTCGTCACAG ACGAGCTTGC ACATATTTTA AAAAATTATG TAGGAAAAGA 294240 
GGTATCGCTG TCAGATTTAA GTAATTTAGC AAATGAGATC AGCGAATTTT ATCGTCATAA 294300 
TAATTATTTA GTGG CAAAAG CAATTTTACC ACCGCAGGAA ATTGAGCAAG GTACGGTAAA 294360 
AATTTTGTTA CTTAAAGGTA ATGTAGGCGA AATACGCTTG CAAAATCACA GTGCATTAAG 294420 
CAATAAATTT GTATCCCGTC TCTCTAATAC AACAGTGAAT GCGTCCGAAT TTATTTTGAA 294480 
AGATGAGTTA GAAAAATTTG CTTTAACAAT AAACGATGTT CCAGGGGTAA ATGCTGGTTT 294540 
GCAACTCTCC GCAGGCAAAA AAGTCGGCGA AGCGAATTTA CTGATTAAAA TTAACGATGC 294600 
AAAACGTTTT TCTAGTTATG TTTCAGTAGA TAACCAAGGC AACAAATATA CAGGGCGTTA 294660 
CCGTTTGGCT GCGGGGACAA AAGTCAGTAA TTTGAATGGC TGGGGCGATG AATTGAAACT 294720 
GGATTTAATG AGTTCTAATC AGGCTAATTT AAAAAATGCT CGTATAGATT ATTCCTCTTT 2 94 780 
AATTGATGGT TATTCTACGC GTTTTGGTGT AACTGCTAAC TATCTCGACT ACAAATTAGG 294840 
TGGTAATTTT AAATCATTAC AATCTCAAGG GCATTCCCAT ACCTTAGGGG CTTATTTGTT 294900 
GCACCCAACA ATCCGTACAC CAAATTTTCG TTTAAGTACT AAAGTGAGCT TTAATCATCA 2 94 960 
AAACCTTACG GATAAACAAC AAGCGGTTTA TGTGAAACAA AAACGGAAAA TAAATAGTTT 2 95020 
AACTGCAGGG ATTGACGGTT CTTGGAATCT TATTAAAGAT GGGACAACTT ATTTTTCATT 295080 
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ATCTACCTTG TTCGGTAATT TAGCGAATCA AACTAGTGAA AAAAAACAGT ATACAAAAGA 295 140 
GGATTTTCAA CCACAATCGC ATTTTACGGT ATATAACTAC CGCTTGTCTC ACGAGCAAAT 295200 
TTTACCAAAA TCTTTTGCTT TCAATATTGG TATAAACGGT CAGTTTGCCG ATAAAACCCT 2952C0 
TGAAAGCTCA C AG AAAATG T TGCTTGGTGG TTTGTCTGGA GTACGTGGAC ATCAAGCTGG 29S320 
TGCGGCTTCT GTGGATGAAG GACATTTGAT ACAAACAGAA TTCAAACATT ATTTACCAGT 29S38 0 
C TOC CCAA AGTGTGCTAG TTAGTTCATT GTTCTATGAC TATGGTTTGG GAAAATATTA 295440 
TAAAAATAGC CAATTTCTCG AAAAAGGCGT G AAAAAT AG C GTGAAATTAC AAAGCGTTGG 2955 00 
TGCAGGACTT TCTCTTTCAG ATGCTGGCAG CTATGCCATC AATGTCAGTG TCGCCAAACC 29556 0 
GCTTGATAAC AATATTAATA ATGCCGATAA ACACCAATTT TGGCTTTCTA TGATTAAAAC 295620 
TTTCTAAGGA GGTATGCAAT GTATAAATTA AATGTAATTT CTCTTATCAT TTTGACCACT 2956 30 
TACACTGGTG CGACATATGC CTCTG CACGG GATTTGCCAC AAGGTAGCAG TGTAGTTGTT 295 740 
GGGGAGGCAA ATGTTTCTAC AATTGGAAAT AAAATGACGA TTGATCAAAA AACCCCAACA 295 800 
ACTCAAATAG ACTGGCATAG TTTTGACATC GGTCAAAATA AAGAAGTGGA ATTTAAACAG 295 860 
CCTGACGCAA ATTCTGTGGC ATATAACCGT GTAACTGGCG GCAATGCTAG TCAAATCCAA 295920 
GGGAAATTGA CGGCAAACGG TAAAGTTTAT TTAGCAAACC CTAATGGGGT TATTATTACC 29598 0 
CAAGGGG CAG AAATTAATGT AGCTGGTTTA TTTG CAACAA CCAAAGATTT AGAGAGAATC 29S0 40 
AGTGAAAATG GCAATGGCAA TGGCAATAAA TTTACCCGTA AACTAAAAGA CGGTCAGGTG 296100 
GTAAAAGAAG GTCAAGTTAT CAATAAAGGG AAAATTAAGG CGAAAGATTT CGTGGTACTT 296160 
AATGGCGATA AAGTTATCAA TGAAGGGGAA ATTGACGCTA CAAACAATGG CAAAGTATAT 2 96220 
TTATCCTCAG GCTATAATTT T AC ATT T AC C TTATCAGATA GCnGCATAAG TGTGGCATTA 296280 
GAAGATAATG CAGTACAGAG TATTGTTCAA AATGAAGGGA TTATTAAAGC AGGGGACATT 296340 
ACGCTAAATG CAAAAGGGCG TAATCAAGCA TTAGATAGCT TGGTTATGAA TAATGGTGTG 295400 
TTAGAAGCAA CTAAGGTTAG TAATAAAAAC GGCAAGGTTG TGTTATCTGC TGACGATGTT 2964 60 
CAGTTGAATA ATAAAAGTGA TATTAAAGGG GAGAGTGAGG TTGTATTTAC TAACGAACCT 29652 0 
AAAAACAAGA TAAAAATCAC TTCTCAAACT GGCTCAAAAG TTACCTCGCC TAAAATCAAC 296580 
TTTACAGGCA AAAGCGTTAA TATCAATGGT GACTTTGGAC GTGACGATAG TAAAGCGCAC 296640 
TATAACGAAG AGCATAAAAG ATTAGATACA GAAGTTAATA TTGACGTGCC AGACAATGAA 296700 
AATATTCGTA TTGCGGAGAA AGATAATACG GGCACGGGTA CGGGTACGGA TTCATTTATA 296760 
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CAAACAGGGG CGTTAAGTTC ATTACTAGCT AATAATGGAA AAGTGAATCT TAAAGGAAAG 2 96820 

GACGTTAATA TTT CAGGACG TATTCATATA GATTCCTTTA G AGG CTCAGA TTCGCTATTA 2 96880 

AAGCTAACTA AC CAAGGTC A TATTAAGATC AATCATG CGG ATATTCATTC AACAGGTCGT 2 96940 

TTATTTTTTA TAACCAGTTT GCAGAATGAG AAAGATTCTC AATCAGATAT TACCATTACT 2 97000 

GACAGCAAAA TAAATCTTGG TAACGGAGCT ATGGGATTGG GGCGTAGTCT CGATAAAGAA 2 97 060 

AACTGCGATA AC CAAAG ATG GTGTAGAACG GAAACCTCAC AACGCAAGAa GTTTGATGTT 2 97120 

CATATGCGTA ATGTAGTATT TGATCAGGTT GATGATGTGG TTGTAGCAGG TGGATTTAAA 2 97180 

AAAGTTAATT TAGATAACAT CGTTGCCACA GGAAAAACGA ATTTTTACAT TGATGGGGGG 2 97240 

GTATCCCGTA ATAATTCAAG ATATGAATAC GGTGTGTTAG ATTTAGACAA ACGTACGCTG 2 973 00 

TTATCGGAGT TAGATCAACG CAGACGCCGT TGGAAATATT ACAATGATTT AGATTTAGAT 2 973 6 0 

ATGAATAAAG CTTATTGGCA TCGTTTTGAT ATGTTTGCCA CTAAAAATAC AGGG CGATCA 2 97420 

ACTATTAAAG ACACAGAAAT TAATATTTCT AACAGTAAGA TTAATCTGAA AAATGGATTT 2 97480 

GTACATTTAC TGG CAGAAAA AATTAAATTA GATAATAGCA AGATTGATAT TACTTTTGAT 2 97 540 

AAAGATAATT CACAAGATAT TTCAACTCAA ATTAACCGTT TAGGAATGAA TGGTAAAGTG 2 97600 

TCAATGGTAA ACTCACATAT TAAAATCGTG GGCGATGAGA AAAGCGATAT ATCTGCAAAA 2 97660 

GCACCTTATG CTACGATGTT TTTAATCGGG GAATTAATAG GGGAAAAAAG TTCTATTTTT 2 97 720 

GTGAAATCTC ATCAGGGTTA CACCTTTAGA ACTGATGGTG ATACAAAAAT TG C AGGT AAA 2 977 80 

AACAGTAAAG ATGACTTGAA AATTACCGCA ATTAACACTG GAGGAAGAAC TGGGAAAGAA 2 9784 0 

GTGATCATTA ATGGTGCGCC TGGCTCAATT GATAATGATG CGAATATTGC CAATATGGCA 2 97 900 

TTTACTATTG GTGATAACGC AAAT AC C AAA ACAACCATTG AAAATGCCGA CATCACTGCC 2 97 96 0 

TTAGC AC CAA ACGGTGGGAC AG CTT ATTTA AGCTCCAAAG GGGTAGAAAT TGAGGTAAAC 2 98 020 

C CAAATTC AA ATTTCACATT CTTTGAACTT CCTCGTGAGA AAAACTTTAA TCAAACAAAA 2 98 08 0 

ATTAAAGGCG ATTCTACCAA ACTCAGTGAA CGTGGTTTTG CTAGACTTTA TGACAAAATT 2 9814 0 

AACGGTGTGC GAGCCAGCAA CTTGTCTGCT GAGCAATTGA ATGTGACGGA CGCAAGCGAA 2 982 0 0 

AAAATCATTA ATACAAAGTT AGTGAGTAGC TTAGATGTTG AAAAACTTGT TTCTGTTGCA 2 9826 0 

GTTTGTGATG CAGGTAAAGG TTGCGAAGAA CAACAATTCG GCGATAAAGG TAATAATACT 2 9832 0 

AAAGTGAGTG TTGGAGAGTT AGAAACTGAA CAATAACTAT TCAGAACACA AG GTAAAAGC 2 983 8 0 

ATAAAG CCAA TCTGACAGAC ATCAATTAAA TTGCATGCCT GTCAGATTTA TTAATAGCCA 2 9844 0 
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CTATCGCTAC ATTATTTTAA ACAACCACGA ACAATCAATA CGCCAACATT TCGAGCTTGA 2 985 00 
GCGACGGCTT TTGGTTTACT CAGTTTAATT TTTAATCCTT GAAGTTGATA ACGTGACTCA 298 560 
AGCAAATCCG CCACTTCATA AGCCACTCGC TCAATC AG CA AGAAAGGTTT GCTTTCCACA 2 98620 
TAATCAATAA TCGCTTGACT CACTTCAGCA TAATTTAAAC AATATACAAC ATCATCCGTT 298680 
TCTGCCGCTT GTTTGCAATC CCAAGCCATT TCCAAATCAA ACACAAGTTT TTGCTTTATT 2 93740 
TGCTGTTCCC AATCATATAC ACCGATTTGT GCAAAAACAG TCAATTCTTC AATAAAAACA 298800 
CGATCCATAA GGTTTCCTCT AAAATTATTG TAGGCGTATG CTATCAATTT TGGGTAGAAT 298860 
AACGAACAGT TTTTCTTGCT GGAGTGCAAA ATGAGCCTTT TCGCCCTTTT TTATATGCTT 298920 
TTTGCCTATC TTTTAGGCTC AATTTCCAGT GCGATTTTAA TTTGTCGCAT TGCAGGCTTG 2 98980 
CCAGATCCTC GTCAAAATGG TTCACACAAC CCTGGGGCGA CAAATGTGTT GCGCATTGGA 2 9904 0 
AATCGTAAGT CTGCCTTGGC GGTGCTTATT TTTGATATGT TAAAAGGGAT GATTCCTGTA 2 99100 
TGGGCAGGGT ATTATTTAGG TTTAACCCAA TTTGAATTGG GAATGGTCGC ACTTGGTGCT 2 9916 0 
TGTTTAGGGC ATATTTTCCC AATTTTCTTT CAATTTAAAG GTGGCAAAGG TGTTGCAACT 2 99220 
GCCTTTGGGG CAATCGCACC GATTTCTTGG GCTGTTGCAG GTTCAATGTT TGGTACGTGG 2 992 80 
ATCTTTGTTT TTCTTGTTAG TGGCTATTCT TCATTAAGTG CAGTTATTTC TGCATTACTT 2 99340 
GTGCCTTTTT ATGTGTGGTG GTTTAAACCT GAATTTACTT TTCCTGTTGC ACTGGTATGT 2 99400 
TGCCTATTAA TTTATCGTCA TCACGATAAC ATTCAACGCT TATGGCGTGG TCAAGAAGAT 2 99460 
AAAGTTTGGG CTAAATTTAA AAAGAAATAA AAATTTTACC GCACTTTTCT AAATTTCCAC 29952 0 
TGGAGTTTCC TGTGTATACT AAAGGTTCCG TTTCCACTCG TATTGCGAAA TATTTATTTA 2 99580 
TCATTCTTAT CGTTGCTGGT GTGATTAGTT CTCTCAGTTT GGCAATTATG AGCAGTAATA 2 9964 0 
AATATGATGC CGAAGCCATT AATATTTCAG GTTCTTTGCG TATG CAAAGT TATCG CTTAC 2 99700 
TTTATGAAAT GCAAGAACAA CCAGAAAGCG TGGAAACGAA CTTACGCCGT TATCATATAA 299760 
GCCTTCATTC ATCTGCTTTA CTAGAAGTTC AAAATCAATT TTTCACACCG AATGTATTAA 2 99820 
AACATTCCTA TCAAAACATT TTGCAACGTT GGACGAACAT GGAAAAATAT GCGCGCCAAC 2 99880 
AGGATGTAAA AAACTATAGT AAGCAGCTTA CTGACTATGT GGCAGATGTT GATTATTTTG 2 9994 0 
TGTTTGAATT ACAACGTTTT AG CG AG CAG A AATGGATTTT AGGCGTTTCA GTATTAGGTT 3 00000 
TTGCGATGTT GTTAATTTTG CTGATGGTGT CTTATGTGAT TTGGTATACA AAC CG AG AAG 30006 0 
TTGTGAAACC TTTGCATTTA ATGACAAAAG CCAGTATGCA AGTGCAAATG CGTCAATTCA 3 0012 0 
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ATCATATTCC ACTTGACACA AGGAAACAAA ATGAACTTGG CACACTGGCT CGCGTATTTA 300180 
CACAAATGTC CACAGAACTG GGACAGCTTT ATTCGCGTTT AGAAGAAGCG GTAAACGaAA 300240 
AAACGCAAAA GTTG CGTCAA ACTAACCGCA CTTTAAGCAC TCTTTATCAA AGTGCTCAAC 300300 
TTCTGAATAC AAACACAATT AACGATAAAA TTTTAAATCA AGTTTTAAAT TACATTTTTA 300360 
TTAGCGACCA TTTGAACTTC GTTAAAGTGG AAGTGATGGG GGCTGAACAT TGGGATATTA 300420 
CATTGGGTAA GCAAGATGCT AATAATGAAC TACAAATCGA AACCCTTTCT GTGGATAACG 300480 
AAGAATTAGG TGTACTCTCT TGGCAAGCAG GATTACCTTG TCCAGATCCC CGAATTATG C 300540 
AAAATTTAGC GCAAATGCTT GCTCGTGCGC TTTATTTCCA TAAAAATTTA CGTCAGAAAG 300600 
AACAGCTTTT ATTAATGGAA GAACGCTCAA TTATTGCGCG AGAGTTGCAC GATTCTTTGG 300660 
CTCAAGTCTT ATCATTTTTA CAAATTCAAC TTACTTTGCT CAAACATAAT CTGAAGAAAG 300720 
AAGACGAACA ATCAAAAGAA AAGAGTTTGG C AATTATTG C CAATTTTGAA CAAGCCCTTT 300780 
CTGGTGGTTA TGCACAGTTA CGTGAGCTTT TAGCCACTTT CCGTTTAACC ATACAAGAAG 300840 
CGAATTTACA GCTTGCACTA AAACAAGTGA TTGATTCATT ACGCTCGCAA ACAACAATGC 300900 
AAATGAATGT GAATTGTCAA TTACCATCG C AAAGTTTGAA CCCTCAACAG CTGGTTCACG 300960 
TTCTGCAAAT TGTACGTGAA GCCACCACCA ATGCGATCAA GCATTCTCAA GGCACGGTAA 301020 
TTGAAATTAG CGCAAGAATT AATGCGGAAG GCGAGTATGA AATTTTAGTT GAAGATGATG 3 0108 0 
GGGTTGGCAT TCCAAACCTT GAAGAACCTG AAGGGCATTA CGGCTTGAAT ATTATGG CGG 301140 
AACGTTGTCG TCAGCTTAAT GCACAATTGC ATATTCATCG CCGTGAACAA GGCGGCACAC 301200 
AAGTCAAAAT TACCTTGCCA CATACATTAT ATTAGGACAT CAAAATGCAA AATTTACAAC 3 01260 
CTTTTCACAC TTTCCACATT CAGTCTAATG CTCGTGAAAT TATTGAAGCA CATAGTATCG 301320 
AACAGCTACA ACAAGTGTGG GCGAATTCTA AATCAGAAAA TTTACCCACT CTTTTTTTAG 301380 
GTCAAGGAAG TAATGTTCTA TTTCTAGATG ATTTTAATGG CATCGTGAtC CTTAATCGTT 301440 
TAATGGGGAT TACGCACGAA CAAGATGCAA ATTTTCACTA TCTTCACGTC AATGGCGGGG 301S0O 
AAAATTGGCA TAAATTGGTG GAGTGGTCTA TTAATAATGG TATTTATGGC TTAGAAAATC 301560 
TGGCTCTGAT TCCCGGTTGT GCAGGCTCTG CCCCCATTCA AAATATCGG T GCTTATGGCG 301620 
TTGAATTTAA AGATGTATGT GATTATGTGG AGGTACTTAA TTTAAATACG AACGAAACCT 301680 
TTAGACTCGA CACAGAACAA TGTGAATTTG GTTATCGTGA GAGTATTTTT AAACATCGTT 301740 
ATCAGCAAGG TTATGTGATT ACAGCGGTTG GATTAAAGCT AAAAAAAGAT TGGCAGCCTA 301800 
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TTCTAAAATA TGGTTCGCTT GTAGAGTTCG ATCCGAAAAC GGTCACCGCT AAACAAATTT 3 0186 0 

TTGATGAAGT CTGTCATATC CGTCAAAGCA AATTACCTGA CCCAAATGAA GTGGGTAATG 3 0192 0 

CAGGGAGTTT CTTTAAAAAT CCCGTTGTCA GTTCAGAACA TTTTGAAGAA ATCAAAAAAC 3 01980 

ATCACGAAAA TCTACCGCAC TTTCCACAGG CAGATGGCTC AGTGAAATTA GCGGCAGGCT 3 02 04 0 

GGCTGATTGA TCAATGCAAT CTTAAAGGTT TTCAAATTGG TGGGGCTGCG GTTCATAAAA 3 02100 

AACAAGCATT AGTATTAATT AACAAAAACG GGGCAACAGG GCAAGACGTG GTCAAACTCG 3 0216 0 

CCCATCACGT TCGCCAAACT GTTGCAGrAA AATTTGGTGT ATATTTACAA CCTGAAGTAC 3 02 22 0 

GATTTAT CAG TGCaACTGGC GAAGTAAATA GTGAGCAAAT TATCACGTAA TTTTATAATA 3 02280 

ATTGAACTTT CAATTGATAA AATCTGTCTT ATTAACAAAT TGGTTCAATA AGATAGCTTA 3 02 34 0 

TTGTGCTATA ACTAGGTACA TTGATTGGCA AAAGCCAAAA TAAAGAAGGG AAACGATGGA 3 024 00 

TAAAGAAACA CAAATGATGT TAGTTCCTCA AGGCAGTATC GAa GGTT ATA TTCGTGCAGC 3 0246 0 

AAATGAATAT CCGATGCTGA CCGCAGAGGA AGAAAAGGAA TTGGCAGAAC GTTTGTATTA 3 02 52 0 

TCACGAGGAT TTAGATGCAG CGAAAAAACT CATTTTGTCG CATCTTCGTT TCGTGATCCA 3 025 80 

CGTGGCGCGT GGCTATTCTG G C TATGGTTT GCCACAAGCT GACTTAATCC AAGAGGGCAA 3 0264 0 

TATCGGTTTA ATGAAAGCGG TCAAACGTTT TAACCCTGAA GTTGGTGTAC GCTTGGTGTC 3 02 7 00 

ATTCGCTGTG CATTGGATTA AAGCTGAGAT C CACGAATAT GTTCTTCGTA ACTGGCGTAT 3 02 76 0 

CGTAAAAGTC GCAACCACGA AAGCTCAACG TAAATTATTT TTCAACTTAC GTAAAACTAA 3 0282 0 

ACAACGTTTA GGTTGGTTCA ATGAAAATGA AGTTGATATG GTCGCAAATG AATTGGGTGT 3 028 8 0 

TTCCAAAGAA GATGTGATCG AAATGGAATC TCGAATGAGC GGTGCGGATG TCGGTTTTGA 3 02 94 0 

TTTACCCACT GATGATGCAG AAACAGAGAC CTATTCTCCA GCCCTTTATT TGGAAGATAA 3 03000 

AAGTTCTAAT TTCGCAGCAG AGCTGGAAAA TGAAAACTTT GAAAGCCAAG CAACCGAGCA 3 03 06 0 

ATTAGGTGCG GCGTTACAAA GTCTAGATGC ACGTAGTCAG GATATTATCA AAG CTCGTTG 30312 0 

GTTAGATGAT AACAAAGCCA CATTGCACGA CTTGGCAGCA AAATATAATG TCTCTGCGGA 3 0318 0 

ACGTATTCGT CAATTAGAAA CTAATGCACT GAAAAAATTA AAAAGTGCGG TGAATTTTTA 3 0324 0 

AATAATTTAG AAAAAGAAAA CCTGCTGATT AGCAGGTTTT TTATTTCTCA TTTAACCGTA 3 03 3 00 

ATTTCAAATC TTCAGCAGTT TGGCAAGTCT TAATCTTATC AAACACTTGG TTCGCTTCAT 3 03 36 0 

TATATTCCTT ATTCAAATAA CGTAACCATT GTTTAATTCG TGCCACATGn TAAAAACCGC 3 0 3420 

TGTCATATTC ATTTTCGACA TTCGCATATT TTtGCAAGAT TTTTTGAATC TCATACCAAG 3 034 8 0 
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GCATTTTTTC TGCATTTGAT TTCAGAACAT GACTTAAATT CGGAATATTC AACGCGCCTC 303540 
GTCCGACCAT TAAATCCTGA CAACCTGTTT GGGATAAGCA ATCTTGCCCA TCTTG CCAAT 303600 
GCCAAATTTC TCCGTTAGCA ATAACTGGAA TGGATAAACG CTCTCGGACT TCGCTAATTT 303660 
TTTTCCAATT AATTCGATCA GCACGATAAC CATCAGCTTT TGTACGTCCA TGAACTGTAA 303720 
TTTCGGTTGC GCCGCCTTGC TCTACCGCAT CAGCGATTTC AAAGGCTTGA GAAATATCAT 303780 
CCCAACCTAA CCGCACTTTT ACACTAACGG GAAATTCACT TGGCACGGCT CTGCGTAAGG 303840 
CTTGAGTGGC ACGATAAATC AATTCAGG CT GTTTCAATAA TGCCGCACCA CCATTGCTGC 303900 
CATTCACTGT TTTAGAAGGG CAACCACAAT TTAAATCAAT GCCATGAGAA CCAAGATCGA 303960 
TCGCACGAAT TGCATTTTCA GCAAGGCATT TTGGATGCTG CCCTAGCAAC TGsACTCGCA 304020 
CAGGCGTGCC AGAAGAAGTA AAGCCCTGAT TTTTTAATTC AGGG CAT AAA CGATAAAATA 304080 
CTTTTTCAGG GAGAAGTTGA TCAACTACGC GAACAAATTC TGTTATACAT AAATCGTAGT 304X40 
CATTCACTTC GGTGAGAAGT TCACGTACAA AGGG ATC TAA AACCCCTTGC ATAGGCGCAA 304200 
GAATAACACG CACTATTTCC AGCCTTTTGC TTTACAAATT AAATCGTAAG CGGCTTGAAT 304260 
CTGTTGAGTT TTTTCTTTTG CCATTTCCAT CATTTCTGGC GGTAAACCTT TCGCCACAAG 304320 
TTTATCTGGA TGATGTTCAT TCATTAAACG ACGATAAGCA CGCTTAACCG TGCTTTGCTC 304380 
GTCGGACTCA GTCACACCCA ATACTTTATA GGCATCATTT AAAGTTGGGC CTGAAGCATG 304440 
TTGGTAACCA CCACTATTTT GTTGTTGATA TTGGTAGCCA CCTTGATAGG CACCTTGTTG 304500 
ATATTTCTGA TAAAAACCGC CTTGAGTAAA AGCTCGAGCA GCCATTTCCA TCGCAATCAT 304560 
TTGTTCAAAT TGCATACGAG AAAGACCAAG TTCTTCAGCG ATCACATAAA GCACTTCTTT 3 04620 
CTCATTTTCG TGAAGTTCTG AATCGGCAAA AGCTGCTTGA ACTTGCACTT GTAAGAACAT 304680 
TCTCAATAAA TCAGCACGTT GCCCACAACC AATGCGAAAT TCACGGATAA CTTGACGAAT 304740 
CGGGAAATCA GATTCTTTAC CACGACGAAA CGCATCTTGC GCTAATTTAC GCCCAGCATC 304800 
ATCCAGTTTA AG s TGAATCA TTAATTGATT AGCTAATTGG ATATCTTCTT CTGTTACACG 304860 
CCCTTTCGAT TTGCTTAAAT GCCCTAACAC CGCAAAGGAG GTTTGCATAA ACAAGTCTTG 304920 
ACGCGTTGTT TTCTTTTTAA AGAAACTAGn ACTTACCGAG CCAAGnTCAT ACAATTTTTT 304980 
ATCTGCAATA GAGCCTAAAA TG AG AC C AG C AATCG CAC C A AAAAATCCAC CAACTTTCCA 3 0504 0 
ACCTAAAAAT ACGCCTATAA TTTTTCCAAT AAATTCCATT CTTTCCTCTG TTAAGTCAAA 305100 
GTGCGGTTAT TTTTATAATG TTATTTATAA TGATGATAAA AATTTATTTC TCAAGTTTTA 3 05160 
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TACGCCAAAT TCGGCACGAT AAGCACGCAT TTCTGGCAAA TGGCTACTGT ATCGAGGATC 305220 
TTGTTCGATA AATTGCATCA AATCATCTAA ATCAATGATT GAT AG CACTT GGCATTGATA 3 05280 
ATCACGTTCT ACTTCTTGAA TTG CTGAAAG CTCGCCTTTt CCTCGTTCCT TACGGTTTAG 3 05 34 0 
AGCGATCAAT ACCGCCGCAA GTTCTGCCTG ATTTGCACTG ATTAATTCCA TAGATTCACG 3 05400 
AATTGCAGTG CCAGCCGTAA TCACGTCATC CACAAGCAAA ATTTTTCCTT GCAATGGGCT 3 0546 0 
ACCAATTAAA TTGCCGCCTT CTCCGTGATC TTTCACTTCT TTG CGATTAA AACACACGGG 3 05520 
TTTATCAATA CCATAACGAT TAAATAACGC CACCGAAACT GACGTGCCAA TTGGAATACC 3 05 58 0 
TTTGTAAGCA GGGCCAAAAA CCACATCAAA ATCTACCGCA CTTGCTTGAA TTGCCGCAGC 3 0564 0 
ATAAAACTCG CCTAAACGCG CTAAATCTGC CCCCGTATTG AACAAACCTG CATTGAAGAA 3 05 700 
ATACGGACTT TTACGCCCCG ATTTCAAAGT AAACTCGCCG AATTTCAATA CATTTCGGCT 3 05 76 0 
TAAAGCAAAT TCGATAAAAT CGCGTTTATA TTGTTCCATT TTTTAACTCC TATAAACTGT 3 0582 0 
GTTTATAAGC CTAACGCTTC GCGTTGAGCG ATAAAAATTT GATT AC AG CC TTGTTTCGCT 30588 0 
AAATCTAATA ATGTTAATAA TTCTTCATGG CTAAATGGCT CGCCTTCTGC AGTGCCTTGC 305940 
ACCTCAATCA TGCGACCATC TTCCATCATC ACAACATTCA TATCGGTTTC CGCTGCAGAA 306000 
TCTTCCACAT ATTCCAAATC ACAAACAGCT TGACCATCAA CAATCCCTAC AG AAATTG CC 306060 
GAAACTAGCC CTTTAATGGG GTTAGTTTTT AACGTGCCAT TCTCAATTAA ACCATTGATC 306120 
GCATCACATA ATGCGACTGC AGCACCAGTA ATGGATGCAG TTCGTGTGCC GCCGTCCGCT 306180 
TG Aa T AAC AT CACAATCTAA AGTAATCGCA CGTTCACCCA GTGCTTTAAG ATCCACCATG 306240 
GCGCGCAAAG AGCGGGCAAT TAAGCGTTGA ATTTC CATTG TGCGTCCGCC TTGTTTACCT 306300 
TTAGCCGCCT CACGTTGCAT ACGACTATGT GTTGAACGTG GCAACATTCC ATATTCAGCC 3 063 60 
GTTACCCAAC CTTGCCCTTG TCCTTTTAAG AAACGTGGCA CAGCATCTTC CACTGTTGCA 3064 2 0 
GTACAAAGCA CTTTGGTGTC CCCAAATTCA ACAAG C AC AG AACCTTCAGC ATGCTTGGTG 3064 80 
TAGTTACGGG TAATTTTAAT TTGGCGGGGT TGATTATTTT CTCGATTATT TGGACGCATA 306540 
ATTATTCCTT GATATGGTTT CGAACGATGT AATAAAAAAG CGCACTATTC TAGCACGCTT 3 066 00 
TAGTTTTTAT TTTTGTTAAA ATTCGCTAAG ATATTGATTA TTGATAATAA AACAGCAAGG 3 06 660 
TGCTTTACCT TGCTTTATTA ATTTGGTGGA GATAATCGGG ATCGAACCGA CGACCTCTTG 306720 
AATGCCATTC AAGCGCTCTC CCAACTGAGC TATATCCCCA TAAAAGACAG CGACATGATA 3 06 7 80 
AAATTTGACT ATGTGGCTGT CAATACGATT TTATTATGTG TTTGTTATCC GATTAAATAT 3 0684 0 
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TAAGCGTTTT GTGCATGAAT AAAATCAATA GCACGCTGAA TACGCGCAAG TACGCGATCT 3 06900 
CTGCCGATAC CGACTAGGGT AACATCCATT GATGGCGATT GACCAGAACC TGTCACCGCG 306960 
ACACG TAATG GCATGCCAAC TTTAC CCATG CCCACTTCTA GTTCAGCAGC TGTTTGCTCA 307020 

ATGGCTTCGT GAATAGAATG TAAATCCCAG CTAGAAAGTG CGGTTAATTT TTCTTTCACT 3 07080 

TTTGCAAGCG CTTCAGCAGC ATTGCCTTTG AAATGTTTTT tCGCTGCGGC TTCATCAAAG 3 0714 0 

GTTTCAAACT CTTCAAAGAA GTAGCGGCTT GACCTTGCCA TTTCTTTTAA GGTTTTGCAA 3 07200 

CGTTCCGCAA GCATGGTGAC TATTTCAGTT AATGCAGGGC CATTGCTTGT ATCAATGCCT 3 0 7260 

TGATCTTTGT AATG CCATTC AAGG TGTTTA GCCACATATT CTGGCGGTAA TTCACGGATA 307320 

TAATGTTGAT TCAACCATTG TAATTTTtCG GTATTAAATG CGCTAgCTGA TTTGCTAACA 3 07380 

TGATCTAATT CGAAATAGTT GATCATTTCT TCGCGACTAA AAATTTCTTG GTCGCCATGT 3 07440 

CCCCAGCCTA AACGAACAAG ATAGTTAATT AAAGCTTCTG GTAAGTAACC ATCATCACGA 3 07 500 

TATTGCATTA CGCTTACTGC GCCATGTCGT TTAGAGAGTT TTTGACCATC ATCGCCATTA 3 07 560 

ATCATAGAAA CGTGTGCATA GGTTGGAATT GGGGCACCAA TCGCTTTTAA AATGTTAATT 3 0 762 0 

TGACGAGGCG TGTTGTTGAT ATGATCTTCC CCACGCACAA CATGTGTGAT ACCCATGTCC 3 07680 

CAGTCATCTA CAACTACACA GAAGTTATAA GTTGGAGAAC CGTCTGTGCG GCGAATAATA 3 07740 

AGATCATCAA GCTCACTGTT GCTGATTTCA ATACGGCCAC GTACGGCATC ATCAAATACC 3 07 800 

ACCGAG CCTT CAGTTGGATT TTTAAAACGT ACAACGTGGG GTTCATCAGG TGAATGGTTG 3 0 786 0 

TGATCGTGTA AGCAGTGACG GTCATAACGT GGTTT T TCTT TATTTTGTTC TTGAGTATGA 3 07920 

CGAAGTTCTT CTAAGTGTTC TTTAGTACAA TAACAACGAT ATGCTAGGCC TTGTTCAATC 3 07 980 

ATTTCATCGA TCACTTGGTT ATAACGATCA AAGCGTTTAG TTTGGTAATA AGGGCCATGC 3 08 04 0 

TCCCACGGAA GATTTAACCA TTCCATG CCT TCAATAATGG CAGCAGTGGC TTCAGGTGTT 3 08100 

GAACGCTCTA AATCAGTGTC TTCAATTCGT AATACAAATT CGCCGTTATT ATGTTTTGCG 3 0816 0 

TATAACCAAG AATAAAGTGC GGTACGTGCG CCACCTACAT GTAAATAACC TGTAGGGCTA 3 0822 0 

GGAGCAAAAC GAGTACGCAC TTTTACATTT GGAT CTAAAT T AAAAGG AG C ATCTAGTTTC 3 08280 

ATTTTTATAA CCTATCTTCA ATAAAATATT AGTCGTTATT CTACTACGAC TTGTTTGACT 3 08340 

TCTAAATAAA AAAGCGGATT TTATG CTTAT TTTTACGACA ATTAAACAAA ATTTAATAAA 3 08400 

AAGGTATTGA CTGAAAATGC ATTCTCCCTA TAATGCCAAC C C AC AAC ATT AAGGGCGATT 3 084 6 0 

AGCTCAGTTG GGAGAGCACC TCCCTTACAA GGAGGGGGTC ACTGGTTCGA GACCGGTATC 3 08 52 0 
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GCCCACCACT TCTCTAAAAA GTTTTCTTAA TTTTACTAAG TTTGACCAAG CGGG CG ATTA 308S80 
GCTCAGTTGG GAGAGCACCT C CCTTACAAG GAGGGGGTCA CTGGTTCGAG ACCGGTATCG 308640 
CCCACCACTT TTCTAAAGTG TCCTTTCCGC TCAAACTTTC CCTTTATATT TAAACGCTTT 308700 
CTAGTGGGCG ATTAGCTCAG TTGGGAGAGC ACCTCCCTTA CAAGGAGGGG GTCACTGGTT 308760 
CGAGACCGGT ATCGCCCACC ACTTAAAAGT GTTTAAATAA ACCGAAAATT TAGGGTTTTG 308820 
CCTTAATATT ATCGGGCGAT TAGCTCAGTT GGGAGAGCAC CTCCCTTACA AGGAGGGGGT 308880 
CACTGGTTCG AGACCGGTAT CGCCCACCAC TTTATTTTCA AATTATCCAA ATAATTTCCT 3 08 94 0 
TTTCCCGTTC AATCTATTAC GTTTACGTGA TAAAGTGCGC AGGCATTGTC ATATCTAATT 309000 
TTAAACTGTT TATATATCAT AGGGAAAATT TATGATCGCT TATATTTTCT TAGCTTTATT 309060 
CACGATTGCG GCAGTGATTT TTATCGTAAA CTCCCATTAT CGTTGGACTT ATTTCTTTGC 309X20 
TATTACGCTT TTTACTTTTT TATTTGGTGG CATGCTTATG GTTTCCAGCC AATGGCAACG 309180 
TGCTTTAAAT TTTTCATCTG TGCTTTTTGT GGTACTGATG TTATTTCACC GTTTAAAAAT 309240 
TCATTATTAC AAACAACCTT TGTTAATTTC AGATTTCTTT CTTGTGGTAG ATTGGCGAAA 309300 
TTGGGAAACC TTAATTCATT ATAAAGGTGC ATTGTTTGGC GTTATTGGGC TACTGGCGTT 309360 
ATTAGGTTAT G CGATTTTkG GTTTTAATGA TGTCGAGTCT TTAGGGGTAT TGGGTAACAG 309420 
CATTGGTGCT TTATTGTTTA TCGTTAGCTT TAGCTTGATG TGGCATTATT CTAAAAATCC 309480 
AAGTGCGGTG CAAGTTTGGT TAGATTCTTT GCCTGATGAT GGTCGTGATG TGTTTTTAAA 309540 
TTTACCAATG TCTTGTCGCG G TATTTTTTT CAAAGTGCCA AATTTTGATG GCAATAGCCA 309600 
AAATTTTATT GAAAAAATGA CCGCACTTTC ATCTGATGCA AACAACCTAT CTGAAACAAA 309660 
AC CCG AT ATT GTTGTAACAT TAATGGAATC CACGCTGAAC CCTCATCAAT TTGCTTTTAG 309720 
TCAGCAATCT ATTCCACCAC TTTCCATGTT TGAACCCCAA AATGATACAG TATTTG CTTC 309780 
GCCGTTGCGT GTACATACTT TTGCTGG CGC AACATGGAAA TCAGAATTTG CTTTTTTAG C 309840 
CGGTGTGCCT TCAACCGATT TTGGGGCGTT AGcAAGTGGA GTTTTTTATT CTGTTGTACC 309900 
GCACTTGCAA TCAGGTTTAG TGAAAAATCT AAAAGCTCAA GGATATTTTT GTGTTGCCTT 309960 
GTCGCCTTTT ACAAAAGGTA ATTACAATGC AAAATCCGCT TATGATCATT TCGGGTTTGA 310020 
TTTGATGTTA CAGCCGCAAG ATCTTGGTTA TCCTGCACCT ATCAGTAAAA ATCTTTGGGA 310080 
TATTAGTAGT GAAGAAATGA TGAAATACAC CCGAATGATT TTGGAAAAAC AGCATCCAGC 310140 
GTTAGAAAAT GTAGATCAGC CAATGTTTGT TTATGTATTG ACTATGCGTG AACATGGtCC 310200 
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ATACGAATTA GGCATGGaAA ATACATTTAA TCnTCAAATG CCTnATTTAG GAGCnAAAAG 310260 
TAnnTCTGCA TTGAATGACT ATACTCAACG TATTGTTGCA TTGAATGATG CGATTGAAGG 310320 
AATAAATAAC TATTTACATG AACGTAAAAA GCCTTTTGTT CTAGGTTATT TCGGTGATCA 310380 
CCAAGTGGCA TTTGATAATG CGATCCCTCC GAAGAAAGGG GATTATGCAC AGCCTGATTA 310440 
TGTGACACAA TTTGTTGTAA GAAGTAATTG TGCAAGTCAA TTTAAGCAAG AACAAAAGTT 310500 
TTTAGATCTT GCTTTTGCTG GAGGTGTTTT GATGAATGTA GCTGGATTAT CAGCAGAAGA 310560 
TGAATTTATG AAAGCGAATA TGGCAATGTG TAAATTGAGT GATGGCAAAT TAGAAGATTC 310620 
TtCTGATATT CAGCTACTCA ATAATTATCG CCATTATTTA TACCAAACAT TGGCGATTGC 310680 
CAGATAATTT TCTTATnGAT CGAGCATTAT TGCTCGATCn GCTCGCCAAT TTCATCCAAG 310740 
GTTGAAGCTA ATTGCGCACC AACTAGCACC ACAAGCCAAC TTATTTGAaT CCACAATAAC 310800 
ATAATTGGTA AAGTTGCCAT TGCACCGTAA ATTAATTGAT AAGAGGGAAA AGTGACGATA 310860 
TACCAAGTAA ATGCTTGTTT ACCAAGAGTG AAAAAAATCG CGG CAAG AAA TGCACCGTAA 310920 
GCTGAATGCT TAATTTTCAC TTTTTTATTC GGTACAACCG TATAAATAAG CGTAAAAATA 310980 
AACCAAGTAA ATAAAAAAGG CACAAAACTG AGCAATTTTA AGC CTAATG A TAAATGTTCC 311040 
GATTGCTCAA ACATTATTTT AATATACGAA CTAATCCCAA TGCTCACGCC AATAATAAGT 311100 
GGGCCAAGTG TCAAAATCAT CCAATAAATC GCAAAAGAAG AAAGCGGAGA GCGAGATTGA 311160 
CTATTGTGCC AAATACTATT TAGTGTTCGG TCGATGTTAT TAATTAGCAT CAAGGCAACG 311220 
GCAATTAAAC TCACAATTCC CACCGCACTC ATTTTCTTTG AGTTTGATAC AAATTGATCG 311280 
ATATATTCGC CCACCATATC ACTCGCTGAT GGCGcAAAAT TAGTnAAAAT CATTTCTTTT 31134 0 
AATTCTCCCG TTACTTCATT GAATACAGGA AAAGCAGAGA AAATAGAAAA AATAACCATA 3114 00 
ACTAAAGGAA CCATTGCTAA CATTGTGCTA TAAGTCAATG CGCCAGCTAC TTGATTAAGT 311460 
TTATTTTCTG AAAAACGTTT CCAAAATAAT AATCCAAAAT TTTTTAATGA AATCATAGAA 311520 
ATCCTCCACA ACAATGTTTA AATTTTTTAC CCGAACCACA GACACAAGGT TGTTTCATTG 311580 
TCGGTAATGA TACGGTCGGA TCCACAAAAT AC CAACGATC TTCAATTTTC ACAAAAATTG 31164 0 
ACCGTTCTTG ATGGGCTTGC TCACATTCTT CGCCTTGAAA AATCGCCTTA AATTCCACCG 311700 
CACTTTGAGT TTTTGTAAGG CTTTCCGTTT TTAGAATTTC TAATCCTAGC CACGTAGTGT 311760 
TGTCAGCCCA TTCTTGCAAT AACCTAGGCT TTAATAAAGT TTGTTGGCTT GGCACAGTTG 311820 
TTACAACAAT ATAGGGAATA TTTTTTAACA CATAAGCCGT ATAACGTGAA CGCATAAGTT 311880 

-77.187- 

SUBSTITUTE SHEET (RULE 26) 



WO 96/33276 PCT/US96/05320 

GCTCTGCCGT TTCGGGAAAT GCTTGTCTCA AATGAAATTT GCCACAACAA TCTGCGTAAT 31194 0 

GATGGCTAGA TTGACATGGA CAATTTTCTA AAGAAAGCGC GGTTGAAATT TCTGACATAT 312 000 

TTATCCCTGT TGTTTGAATT TCGATATTTG CGATCGAATT GAGCGTTTAA AGGCTTCAGC 312060 

TAATACTTCA ATCTCTAATG CTGAATCTAA TTCAGCTTCA TTTTTTGGCG TAGATAACAA 312120 

GCGATTTAAA TGACGAATTG TTATTTGCGG ATAAGGACGG GCGAGTAATT TTAATTCATC 31218 0 

CTTTTGCTTA ATTGTTCCTG TTTCAATAAT ACGAACATAC CATCCACTGA GCCCAGTTTG 3122 4 0 

ATAAACAATA TCGCGAGTCT CTTCGTTTTC CGTGTTTTTT GATAAACGCT CACAAGGTTT 3123 00 

ACGTGGTTGA GATACTTCTA AAATGGTTGA GCCGATTTGA TAACG ATCG C CAATACATAC 312360 

ATTGTCTTCA TTCAAGCCTG ACACGACAAA ATTTTCTCCG TATGTCGCAG TATCCATATA 3124 2 0 

AGGAAAAATC TTTATTTAAT AAAGCATTTA ATTCATTAAA AGAATCTGCC GACATAAAAA 3124 80 

AG AG AG C TTT ATCTACACCA CCGTGATGTT TTTTTAAACC AACATCGTTT CCTTCCGCAC 312 54 0 

CAAGAGAATG AATTTTTACC GATGGCACAA CTTTTTTACG AATCGCGCTT TCATACTGGC 312600 

TTCCATCGGA AAAGGTTAGG GTTTCAACTT GCCCCACTTT TATGACTAAA ATTTTTG CAT 312660 

TCATTTTATT TTTCGCTTGT TCTAGAAAAA ATTTACGTGG AATTATAACC GAAAACGTTT 312 72 0 

AACTTTTTGA CCGCACTTCT TTTTTTACTT ATTATTCGTT ATCAGAAAAA CAAAAAAGAA 312780 

ATAATGAAAA TGTTTGAAAG AGAAAAAAAT TTTCTAAAAA TATTTTATTT CTTTTCTTGA 312 84 0 

AAATAAGATT TTTCCTAAGG GCGAAATCAT CACGTTCAAT GATAATAAGG AGATTTATTA 312 900 

TGTCAGGCGT ATTCCACCTA AACCTCACTA AAGCACAACT TAAAGGTGCA ACACTCGCTA 312 960 

TTGTTCCTGG CGATCCTGCG CGTAGTGAGC GTATTGCGAA ACAACTTGAT AATCCAGAGT 313 02 0 

TTCTTACAAG CACGCGTGAA TTTACATCGT GGTTAGGTTA TATCAATGGT CAGCCTATTG 31308 0 

TTGTATGTTC TACGGGTATT GGTGG CCCTT CAACATCAAT TTGTGTGGAA GAACTTGCTC 31314 0 

AACTTGGTGT TCGCACTTTC TTACGTATTG GCACAACGGG TGCAATTCAA CCGCACATTA 31320 0 

ATGTAGGCGA TGTTCTTATC ACAACTGCTG CTGTCCGCTT AGATGGTGCA AGCCATCATT 31326 0 

TTGTACCATT AGAATATCCA GCTGTGGCAA ACTTTGAATG TACAACCGCA CTTTATAATG 313320 

CCGCGAAAGC AAAAGGTATT GAGCCTTATG TAGGTGTAAC CGTATCATCT GATACATTCT 313380 

ATCCAGGTCA AGAACGTTAC GATACTTATA GTGGTAAAGT ATATCGTAAT TACCAAGGTT 31344 0 

TGCTTAAACA ATGGCAAGAT TTGAATGTCA TGAATTACGA AATGGAATCT TCAACATTGT 313 50 0 

TTACTATGTG TAGCG CATTA GGTTTGCGTG CAGGAATGGT TGCAGGCGTG ATTGTAAACC 31356 0 
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GCACTCAACA AGAAATTCCA AATGAAGCAA CGATAAAACA AACAGAAGAA AAAGCTGTGT 313620 
CTGTTGTGAT CACAGCTGCA CAAGCATTGT TAAGTTAAAT TAATATTTAT TTGAAACAAT 313680 
AAAACTTTAT TTTATTCTGC ACCTTAATTA GATTTATTTT CTAACGAAGT AAGGTG CAG A 313740 
TTTTTTATTA ATAAACTGCG ATCTTCTGAT CAATTTAATC AAAGTAGTCT ATTTACTTGT 313800 
TGGCATTGAA ACCTAAACTT GATATAAGTT CTAAATAAAC TACTTCTAAA GGCTAAATAA 3l 3860 
CCTAAAATAT TTACGACTAA AATTAGATTT CTGTTAAATC TACATTTTTA GTCTCTTTTG 313920 
AGGCTAATAA TGCCAAGAAA GTCATTAATG CATTGATTGC TAAATAAATT CCTACACCCA 3139 80 
TTACACCGAA TGACGCGTTA ATTTTTAAAG AAATCATTGC TGCAATTGTT GCACCAATAA 314040 
TTGATGCTAA ATTATACG CA AGAGAGGCAC CTGAATAACG GACTTCTGTT GGGAATAATT 314100 
CTGGTAATAG CGCAGCCATT GGAC CAAATG TCATTCCCAT AATCG C CAT A CCAATCACTA 314160 
AAAATGCAAA TACGCTAACT GGAGTGCCAT TTTCTAAGAA TAACGGCATT GCTAAACCAA 314220 
GCACGCCGAT AGCAATAGTC ACCCAAATCA ACCATTTACG ACGACCGATT TTATCGGCAT 314280 
AAAAACCGGA AATACTGATA AAAATACCAA ATACAATGGC ACTAATTAAT AACAAGCCAG 314340 
TAAATGTATT TGCTGGGATT GCTAAACCAA GTGCATAACC TGCTTCAGAA AGCTTTGGTG 314400 
CTGTTCGAGA ATATGCCTGC GCAAAAGCCG TCATAATGTA AAACAGAGAA TAAGTTGCCA 314460 
CCATAATAAA TGTG CCAATA ACCATTGGTT TTAAGTGTTT GGTAAACACA ACACTCACTG 314520 
GTGCATTCAA TTTTTTG C CT TTTTGCTCTG CTTCCACAAA TACGTGGCTT TCGTGCAAAG 314580 
TTAAGCGAAC ATATAAACCA ACTGCAACTA ATAAAATGGA AGATACAAAT GGAATACGCC 314640 
ACGCCCATTC AACAAGTGCA TTATGACCAA GCAAATAGCT AACTAAGAAG AACGTTCCAT 314700 
TAGCGACAAA TAAACCAATT GGTGCGCCTA ATTGAGGGAA AGTACCGTAC C AAG CTCGTT 314760 
TTCCTTCTGG TGCATTTTCT GTTGCGACTA ACGCTGCACC GCCCCATTCG CCCCCAAGCC 314820 
CAATACCTTG ACCAACACGG CATACGCAAA GCAAAATCGG CGCCCAGATA CCAATTTGAG 314880 
CATAATTAGG AAGTAAACCA ATTACTACCG TTGAACCACC CATTAAAACA AGAGAGGCAA 314 94 0 
CCAAGGTTTT TTTACGACCA ATTTTATCGC CAAAGTGACC GAATAATGCC GAACCAATAG 315000 
GACGTGCAAA GAATGCTAAA GCAAGCGTAG AAAGAGAAAG TAAATCATTA GAAAGCGGAT 315060 
CATCGCTATG AAAGAATTGC GTATTGAAAA CTAATACAGC GGCAGCCGCA TAGATATAAT 315120 
AATCAAAAAA TTCGATTGCT GTACCGACCA TCGAGGCTAA CGCCACTTTC ATCGGATTAT 315180 
TACGAAGTTG TGTAGACATT GTGTTTCCTT AGAAGTTTAT TATAAAGAGA AAGGATTGAA 315240 
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TCCTAGCATT TTAACTAAGG TCTAACAATT TTTAATATGC AATTTT CTAT TTTTTTAGAG 315300 
AAATATTTTG AATTTTCTAA ATGTGAGTTA TGTCCAGCGC AGGGAATTGT CACTAAATCA 315360 
ATTTGATTTT CTTTGGCTAA TACTTGGAAC TTATGATCTC TTTCGCCACA AAAATAAAAA 315420 
AATGG CAAAG AACTTAACCG CACTTTTTCA CTAAAATCAG GTTGTTTGGA TAGACTTGTC 315480 
GCCAGCAACA TTTTG CCAAT ATTTTTCCCA CAATTTGATT TTCGTTTTTC AACCAACTGT 315540 
AATCTTTCTT CTGTAGTCAA ATGAGAAAAC ACAGGTTGTT GATACCAATC ATTCAACACT 315600 
TTTTCTGGCG ATTCTTG CAT AAAACGTTGC GCCCAAGCAA AATCCTGCTG AAAACGAGCC 315660 
TGCTTTTCTT CGTCAGTTTT TAAACCTAGA TTCGCACCTT CTAAAATAAC GCCTTGCAAA 315720 
TTAGATCGTT CCAGCTGAGC TTGCAGCGCA TAATACAAAG CAATCCTGCC ACCTAGAGAA 315780 
TACCCAACAA GAAAATAGGG TCCATTTTTC ACCGCACTTT TAATTTGCTG AGCCAAATAC 315840 
TCCGCCGAAT CTTCAAAATT TGTGACTTCA AGGTCTTTAG CTTGCCCGTG AAAAGGCAAA 315900 
TCTAGCGCAA TACAATTAAA GTGCGGTAGA TTTTCAATGA CTTTTTGCCA GTCATTTTTT 315960 
GTACCAAGAA GTCCGTGAAG AAAAATGATG TTTATCATTG AAGGGATTAT TGATAGAAAT 316020 
AATAAAATAT GGCGGACTAA AGCCCGCCTA CAGAATACTT AAGCACCAAT CACGGCGTGA 316080 
CTAATTTG CT CGATTAAGCG TTTATACAAA CTGCTACCAT CACTCGGATT CGTTTTAATT 31614 0 
TCAATTAACG TTGCCTTGCG ACGACTATAA GCCTGTTTCA CAACAGAATT GAGATCTGCC 316200 
CAAGTATAAG GATGAGCGTA TTTAAGATCG AACATTGCCG CGATTTGAGA AAAATCGCCA 316260 
TTATGCGGTA ATCGATAGAA CTGATCTTTG ACTTGTTCAT CCACAGGTAA CATATCAAAA 316320 
ATCGCGCCAC CATTATTATT GATCACAAAG ATTAGCGTTG GTTGCGTAAC GTTTTTGAAT 316380 
AATGCGAAAG AATTCAAATC ATATAAAGTG GACGTATCGC CAATCACCGC AACAACAGGT 315440 
TTATTTGAAC CAATACCTAT TCCAGCAGCA GTAGCCAACA AACCATCAAT TCCACTCGCA 316500 
CCACGATTGG TATAAACGGG ATAACTTTCT GGTAATTGCG TTAGCGCATC AACTAGACGA 316560 
ACAAGTAAAC TATTACCTAA AAATAAAACA CCGTTATAAG GTAAAAGTGT TGGTAAACGT 316620 
AATGCGAGCG AGGCTTCCGT TAAATTTCCG CCTACTTGCT GTTCAATAAA GGTCGCACAG 316680 
AACTTAGATA AAGCTAACGG CTCAAGCAAC CAAGGCTTTT GGCGTAAAGG TGGATGTGCA 316740 
CGCAACCAAT GATGTACTTT AGCATTAAAT CTTGTCAATG AATGATGGTA AGGATCTAAT 316800 
GCTTTACCGC TTTGTTCAAC TAGCCAAAAT TCCCCTTTAA ACGCTTGCAA GAATTGATTA 316860 
ATGCGTTTAC TAATAAAACG CGCGCCAAAT TGAATCACAA TATC AG CTTG TAGAAGTTTT 316920 
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TCACGAACAG TTTGGTTTGC CAGCCAAATA TCTTCATAAG GCGTCGTTGG AACAACACCA 316980 

GATTGAATAT CCGTCAGTAA CACCCAGCCC ATAGCACTTG CCCAAGAATT AATGCCCATC 317040 

GCTTGTTCTG CAGGTAATTG ACCAACCACA ACGACACCAC GTTTGGTACG CCAATGATCC 317100 

CAATTTTCAT GCATTAATAC TTCGTTTTGT TGCGCTTCTA CATTTATCCA AGACTTGTTT 317160 

TGAATTAACC AGCGTTGTAA CGGCTGTAGC CAACTATGGG AATTTACTTC CTCATCAGTT 317220 

G CAT CAT AAA GTGGCTCGGC AAATGGGACA TTAATATGAA CAACGCCACC TTGTTGTTTT 317280 

TGTTGGAAAA CTGCCTGTTC AAGTAGAGAA ATTAACCACT GCGCAGAATA ATCAGCATTC 317 340 

GGTTTAGGTA AATTAACATT TGCAACTGGA TACTGACCAA ACATATTTTG TTGCAAAATG 3174 00 

GCCTGATTTG CACCGCACTC CCAAAGTTCT GGTGGACGAT CAGCAGTTAA AACAAATAAA 317460 

TTTACACCAG TTTGGCGTGC TTCAATAATG G C AGG AT AAA GGTTTGCTGT TGCAGTACCT 317520 

GATGTAACAA TAATTGCAAC AGGCGATTGA GTTGCCTTAG CAATACCCAA AGCAAAAAAA 317580 

CCTAAACCGC GTTCATCAAA ATGCGTATGA CAAGTGACTG CGCCCGCATT TTGCAAACGT 31764 0 

ACGG CTTCTA GAGTTAAAGG TGTCGAACGC GAACCTGGCG CGATACAAAG GTGAGAAACG 317700 

CCTTGGCGCA CCAAGGTTTC TAAAATCACT TTCGACCAAC AACGATTAAA CACGCTTACC 317760 

GACATTTTTT TTCTCCGTTA TCATTTTTCT TCTGCAAACA AGGAAATTAA CCCTGCTGCT 317820 

TTACGTTCAA TTTCTTTCCA TTCTTCCAAT GGCTGCGAGC CTGCCACAAT GCCCGCGCCA 317880 

G CAAATACTC GAATACGATG ACCTTCAATA AAAGCAGAAC GAATCGCCAC ACAAAACTCT 31794 0 

GAACATACAT CGCTCATCAC GCCCAATGTT CCCGCATACC AGCCTCGATC AAAGGTTTCG 318000 

ATTTCTGACA AAATCATTTT GGCTTGTTGC TGTGGCAAAC CAGATACTGC AGCTGTAGGA 318060 

TGAATCGCTT TTAATATATT TACATCTGCA TAATGTG CGG TCAAATTTGC ACGAATTTTA 31812 0 

CGAATCAAAT GTTGTACTTT ACGTAACGGT TTCAATTCCA CATTGCTTAC ATCAAAACTT 318180 

TCTACTTGCT TACGTAAATT CTGCGAAATA TCTTCTACAA CCAACCAATT TTCTTTTAAA 31824 0 

TTTTTCTCAT CATTTAACAA CCAATTGGCT TGAGATTGCG TTTCTTCCTC ACTTTCAGAA 3133 00 

ACCGATGCTG TCCCCGCCAA AGCTTCGGTT AG C AACAAAT TATACTCACG AGCAAATAGG 3183 6 0 

CGTTCTGGCG TAGAACCAAC AAAAACCGAA TGAGAATTTT CAGCCCATAA AAAATGATAA 3184 2 0 

CAACCTTGAT TTTGTTTTTC GCTTTCTGCT AAAAAATCGT ACGCATTAAT CGCTTGCTTT 318480 

AAATGAAAAG TGGTTTCATT GGCTAACACA ATTTTTGTCA GCTCTCCACT TTTAATTTCT 318540 

ACCAAAGCCT GATTTACCCA ATCACACCAA GTTCTTTCAT TGGCTCGAAG CTCTGTATGC 318600 

-77.191- 

SUBSTTTUTE SHEET (RULE 26) 



WO 96/33276 PCT/US96/05320 

AAAGGAATTT GTTTTGGCAA AGCAGAAAGT GCGGTGAGAT TTTCAAAAGT TTTGAGGTGG 318660 
GCTAAAGTAT CATTCGCACT TTG CTCATTT TTCACAAAAA AGGATACCAA CGTGC CTTT A 31 8720 
TTATC CTGTT CAACAAGCAT TTTAGGTAAT ACAAATTGTG CAGTGCCTTG AAATTGCAAA 313780 
CCGCCGACCA GTGGAAAACC ACTTTCTTCA ATAAACTCTT GCGCTAAATT TAACTGTGAA 318840 
AATGACTGCA CTGCCCCAAG CGCAGCCAAT GCTTTTTCTT CATCACGAAA ATGTAAATAA 318900 
AATTGAGGAT ACGCAGATTG ACCTTTCAGC CACGCCAACA AATCGACCTT GTCCAGTTTT 318960 
ACCTGAAAAC GGACAAGTTC ATTCGTTGAT TGCTGTAAAT ATGCGTGAAT TTGCGATTTT 319020 
AACTCGCCTA TTGCCTGCGC TAAATAACTC ATTAGATTTT GTTTTAAATA CACAAAAAAT 31908 0 
TAGCCATATT CTACCGCAGA TTGTTAATGA CAAAAAATTT TTTTACAAAA TCTTTTCAAT 319l 40 
ATTGAAAATT CAGCGTAAAT TAACCGCACT TTTCTCGAAA AATTTATTAG AAACAAAAGG 319200 
aTTTCACTAT GCGATTATTC CCAAAATCCG ACAAATTAGA ACACGTATGT TATGACATTC 319260 
GCGGACCTGT G CATAAAG AA GCACTACGCT TAGAAGAAGA AGGCAATAAA ATTTTAAAAC 319320 
TTAATATCGG CAATCCCGCA CCTTTCGGCT TTG AAGC AC C AGATGAAATT TTGGTGGACG 319380 
TATTGCGTAA TTTGCCGTCA GCTCAAGGCT ACTGCGATTC AAAAGGATTA TATTCTGCTC 319440 
GTAAAGCCAT TGTTCAATAT TACCAATCAA AAGGGATTCT CGGCGCGACC GTCAATGATG 319500 
TGTATATTGG TAACGGCGTA TCTGAGCTTA TCACAATGGC AATGCAAGCA TTGCTCAATG 319560 
ATGGCGATGA AGTGCTTGTG CCGATGCCTG ATTATCCGCT TTGGACTGCT GCGGTAACCC 319620 
TTTCTGGCGG TAAAGCAGTG CATTATCTCT GTGATGAAGA CGCAAATTGG TTTCCCACTA 319680 
TTGATGATAT TAAAGCAAAG GTGAATGCGA AAACTAAAGC GATTGTCATC ATCAACCCGA 319740 
ACAACCCAAC GGGTGCTGTG TATAGCAAAG AATTATTACA AGAAATTGTG GAAATCGCGC 319800 
GTCAAAATAA TTTAATTATC TTCGCTGACG AAATCTACGA CAAAATTTTA TATGATGGCG 319860 
CGGTGCATCA TCACATTGCC GCACTTGCTC CTGATTTATT AACCGTTACA TTGAATGGGT 319920 
TATCAAAAGC TTATCGAGTT GCAGCTTCCG CCAAGGCTGG ATGATTTTGA ACGG C C C AAA 319980 
ACATAATGCG AAAGGTTATA TTGAAGGCTT GGATATGCTG GCATCAATGC GTTTATGTG C 320040 
CAACGTACCA ATGCAACATG CAATTCAAAC CGCACTTGGT GG CTATCAAA GTATTAATGA 320100 
ATTTATTTTA CCAGGCGGTC GATTACTTGA GCAACGAAAC AAAGCCTATG ATCTCATCAC 320160 
TCAAATTCCA GGCATTACTT GCGTGAAACC AATGGGGGCG ATGTATATGT TCCCGAAAAT 320220 
TGATGTGAAA AAATTCAATA TTCACAGTGA TGAAAAAATG GTGCTGGATT TACTCCGCCA 3 2 0280 
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AGAAAAAGTA CTACTTGTGC ACGGTAAAGG ATTTAATTGG CATTCACCAG ATCACTTCCG 320340 
TATTGTTACT CTTCCTTATG TGAATCAGCT TGAAGAAG CC ATTACAAAAT TAGCAAGATT 320400 
TTTGTCGGAT TACCGTCAAT AATGAATGCT TTGACAACAT TACAAAGGGG ACTAAATGTC 320460 
CCCTTTTTAT CGATATAAAT ATATTGCACT TTCCTTGTGT ACTACTATAA TAGTGCAAAT 320520 
TTATTTTAGA GGCATTTTAT GATACAACAA AAATCTCCTT CTCTCTTAGG CGGTGCAATG 320580 
ATCATTGCAG GTACAGCGAT TGGTGCAGG C ATGCTGGCTA ACCCAACTTC AACGGCAGGT 320640 
GTATGGTTTA TCGGTTCAAT TCTTGCTTTA ATTTATACTT GGTTTTGTAT GACAACCTCA 320700 
GGATTAATGA TCCTAGAAGC TAATCTACAT TATCCAACAG GTTCAAGTTT TGACACTATC 320760 
GTTAAAGATT TATTAGGAAA AAGTTGGAAT ACTATCAATG GTCTTTCTGT TGCTTTCGTC 320820 
TTGTATATTT TGACTTATGC CTACATAACC TCAGGCGGAG GCATTACACA GAATCTGCTC 320880 
AACCAAGCAT TCAGTTCTGC TGAAAGTGCG GTAGATATTG GGCGAACTTC AGGATCTTTA 32 0 940 
ATTTTCTGCC TTATTCTTGC AGCCTTTGTA TGGCTTTCAA CCAAAGCAGT GGATCGTTTC 321000 
ACAACCGTGT TAATTGTCGG GATGGTGGTT GCTTTTTTCC TTTCTACTAC TGGTTTATTA 321060 
AG CTCTGTAA AAACAGCAGT TTTATTCAAT ACCGTTGCTG AAAGTGAGCA AACATATTTA 321120 
CCTTATTTAT TGACCGCACT TCCAGTTTGT CTCGTGTCGT TTGGTTTCCA CGGAAATGTC 321180 
CCGAGTCTCG TCAAATATTA CGATCGTGAT GGTCGTCGCG TGATGAAATC GATCTTTATT 321240 
GGTACAGGCT TAGCATTAGT GATTTACATC TTATGGCAGC TCGCTGTACA AGGTAATTTA 321300 
CCACGCACTG AATTTGCGCC AGTCATTGAA AAAGGCGGCG ATGTTTCTGC ATTATTAGAG 321360 
GCTTTACATA AATATATTGA AGTGGAATAC CTTTCCGTCG CATTAAATTT CTTTGCTTAT 321420 
ATGGCAATTT CTACTTCATT CTTAGGCGTC ACTTTAGGGT TATTTGACTA TATTGCGGAT 321480 
CTATTTAAAT TTGACGATAG CTTATTAGG C AGAACAAAAA CAACACTTGT TACTTTTTTA 321540 
CCGCCATTAT TGCTAAGCCT GCAATTTCCT TATGGATTTG TG ATTG C CAT TGGTTACGCT 321600 
GGATTGGCGG CAACAATTTG GGCTGCAATT GTCCCCGCAC TTCTTGCCAA AGCAAGCCGT 321660 
CAAAAATTCC CACAAGCAAG TTATAAAGTA TACGGCGGAA ATTTTATGAT TGGTTTTGTA 321720 
ATTTTATTTG G CAT ATT AAA TATTGTGGCG CAAATTGGGG CAAACTTAGG TTGGTTTCCA 321780 
AGTTTTACTG GGTAAGTTTT AAAAAATAAA TTTAAATTAT TAAATAGCGA TCTGCACTTT 321840 
AACAGTGGGT TATGTAGTCC AACTAATTCG TGTAGATCAG CCCCTTAATT ATTTTACTTT 321900 
TTATTAGTTA CAAACAATCT TAACTGCAAG TCCGCCTTGA GATGTTTCGC GATACTTGGC 321960 
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AGTAATCGCT AAGAATGCAA CAAG CGGGC C AAAATAAGAG ATATATGGAT TATCAAATTT 323700 
ATTCGCTAAG AAAGACAAGA TACTAATATT CTGTTCTTTT GCTGCCACTA ATTCTTCAGG 323760 
TGTTAATGTT AATACACAGC TAAATACGAA GAACATTACG AAGAATAACA AAATCGTTGA 323 820 
GGCGCCTTTT TCTGTATGGC TGATATGGCG TTCTGCACCA TCGAAAGTTT TATATTCACG 323880 
GAATTGTGAG CAAGTGAAAG ATGAAATTGC TGGCGAATGG TTAAATGAAA ATACTAATAC 323940 
GGGAATGGTT AGC CATAAAG TTGTTAAGAA ACCGCCTACT GTTGGAAATT CATATAATAC 324000 
GGCACTATTC CATTCAGGAA TTAAGTAAAT AGAAAG r AC A AATAAAATTA AGACTAACGG 324060 
ATAAACAAGC CATTCTGTGA TTTTTAACAT CACTTTTTCA TTAAATAACA TCACGGAAAT 324120 
CAATACCGCA ATTAATACAA AAGAGAGAAT GACTCGATTT GGTGATGCCA TACCAAGTTG 324180 
ATTCACAATA AAAGAATCTA CGGTGTTCGT GATACCGTTA CCGTAAATCA ACAGAATAGG 324240 
GAAAATCGCG AAAAAATAAA GTAAGGTAAT AAGTTTACCC GCTGTTTTAC CAAAGTGTTC 324300 
TTCCACTACT TCAGTAATAT CGCTGCCAGG ATTTTTTGAT GAAAGTACAA AGTAGGCTAA 324360 
GCCACGGTGA GCAAAATAAG TCATTGGCCC GACGATAATT GCCATTAAAA CTAACGGCCA 324420 
AAATCCACCC ATCCCTGCGT TAATCGGCAA GAACAATACG CCCGCGCCAA CCGCCGTACC 324480 
GAAAAGGTTT AACATCCACG TTGCATCAAA TTTGTTCCAT TTGAGTTTTT CTGTTGATTT 324540 
CATAGTTTGC TCCAAAGCAT AAATTACTTA AATTTGTCAA TTTGTAGAGC AAAATCCGCT 324600 
CCAACTAAAT TTGAGTATTA TGTTATGCCA ATAAATAGAG AATCTCAAAG AGCAAATTCT 324660 
AAAATTGTGA TCAAGTTAAA ATTTTTTTAT AAAAAGTGCG GTAATTTTCA GCGACAAAAA 324720 
AATCGGTGCG ATGACGACAC CGATTGCGTA TAAAAATAAA TGAAATTAGA ACCGCACTTT 324780 
TTTCAATCTT AAGGCGTTCA TCAATACGGA AATTGAGCTC AACGCCATAG CAGCACCAGC 324840 
GATTATAGGG CTAAGAAAAC CGAAAGCGGC AAGGGGTATG CCGAGAATAT TGTAAATCAA 324900 
AGCAAAAAAT AGATTTTGTT TGATATTTTT CAAGGTTGCT CTTGCAATAA AAAGCGCATC 324960 
C AC AAG TTG A TTCACCGAAT GTTGCATAAG CGTTGCAGAG GCTGTTTGTT CTGCAATATC 325020 
TGAACTTGAT TTCATCGCAA AACTGACGTT TGCAGAAGCT AAAGCAGGCG CATCATTTAT 325080 
TCCATCGCCT ACCATTGCGA CAATATGACC TAGATCTTTC AGCTTTTGGA TTTGTTCCGC 325140 
TTTATCTCTT GGAGTTAGTT TCCCAAAGGC TTTCTTAATG CCGAGTTGTT TCGCTATGTA 325200 
ATCCACGACA GATTGTTGAT CGCCACTCAT AATTACGACA TCAATATTTT GCTGTTGTAA 3 25260 
GCGTTGAATG GCGTGCAAGC TATCGTTTTT TAAAGTATCC GTCAGTGCAA ATGCACCGAT 325320 
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GGGTTCATCA TTTATGGATA CTGCGACAAT GCTCGCAATT TGCCAAATAT CCTCTAAATT 32S380 
TTTAGGCAAG ATTAAACCGC AATAATCAGG TTTACCGACT TTTATTGTTC CCACTTGTTC 325440 
TAATTCGGCT TGAATGC CTT GACCCACTTC CATCTTTGAA AAAAGTGCGG TGGGAATTTC 323500 
TAACATTTTT nGTTCTGCTG CTTGCACAAT AGCTTTTGCA ATTGGGTGGT TAGCTTGTCG 325560 
TTCAACTGCT GCTGCAAAAC GATACAAATC GTCTTCAGAA TACACCGCAC TTTGTGGCTG 325620 
CCATAAAGCA GAGATTTCAA GCTCTCCTTT TGTTAATGTG CCAGTTTTAT CAAGTACAAC 325680 
GGTATC CACG TGGGCGGTTT CCTCCATTGC AGCCGCATCT TTAAACCATA CGCCAGCATT 325740 
GACTGCTTTA CCTAAACCAA CCATTATGGC TGCTGGTGTG GCTAATCCCA AGGCACAAGG 325800 
GCAAGCAATC ACAAGCACGG ATACCGCGTG AATTAGTGAA GACACACTAT CATTCGTCAG 325860 
AATATAAGTG AGTGCGAAAG TAACAAGTGA AATCACTAAA ACGACAGGCA CAAACACGGA 325920 
AGTAACTTTG TCAGCAAATC GAGCAATGGG TGCTTTTGAA CCTTGTGCAT CTGATAATGC 325980 
ATTCATCATA TCGCCAAGTA AGGTTTGGCT ACCAAGCTGA TTTGCTCGAT AGATAATAGA 326040 
ACCTTCTGTT ACCATCGCCC CCGCTAATAC TTTTCCACCT TTCTGTTTTT CTTCTGGGCG 326100 
AGATTCCCCC GTTAAATGGC TTTCATCACA CCAACCATTT CCACTTTCAA TTACGCCATC 326a60 
TGCCGCAATA CGCTCGCCTT G ATTCG CACG AATGATTTCG CCGATATTGA CTTGGTCAAG 326220 
AGCAATTTCA ATCCATTTTT CATTACGTAA AACAGTGACT TTTTTCGGTG TCAGTTGCAA 326280 
GAG CATACTT AAGCTATTTA GGCTGTGCTT TTTAGTGCGA TCTTCAAGAA ATTTTC CAAG 326340 
ACTGACAAAA CCAATTACCA TAACAGAAGC TTCAAAGTAC ACATGTCCCA TTGCATGATT 326400 
GGCGTGGTAA AACAACATGA AAGCAGAATA AAGATAAATA GTGAGCGTTC CTGTGCTGAC 326460 
CAGTACATCC ATATTGGTAA GTCCACCACG AATACTACCG ATTGCTCCGC GATAAAATGG 32 6 52 0 
AATGGCTAAC CAAAGTTGCA CAATGCTCGC TAATGCAAAT TGCCAAATCG GCGGTAACAT 326580 
TAAGTTATGG GAGCCACCTA TCATACCAAG CATTCCAATC AGAAATGGAA TATTGATTAT 326640 
CCAAAGTACG ATTAATCGCC AAGGGATTGA TGTATTTTCT TCTATTGGTA ATTCGTTGGC 326700 
TTGTTTGATA TGGGCAGAGA AACCTGTTTT ATGGATAATC TCTATAATTT GAssTTCGCT 326760 
TGCTTGCGTA GCGTCAAATA CAACTTGTGC TTCTTCGGCA GCAAAATTTA CGCCTGCTTG 326820 
TTGCACAAAT GGTTTTTTAT TTAAAACTTT TTCAATGCGA TTGGCACAAG ATTGGCAGGT 326880 
CATCCCACCA ATCTGAATTG AAATTTTTTT ACTCTGTGGC GTCAAATCCA GCATCTTCAA 326940 
TGACCTCAAT CAACTGTGCA aCATTGACTC G ATTTTC AT C AAAAGTAATA TTTGCTTTAC 327000 
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CCTCTAATTG CACATCAGCA GATTGTACAC CATCTAATTC AGTTAAAACT TGAGTAAGAT 3 2 706 0 
TTTTTACGCA ACAGCCGCAA TGTATGCCTT TTATGTTTAA TGTGATAGTT TTCATTTTTA 327120 
CTCCTTGTTA GTTGTTTTAT TGATGAATGC GATTGGCAGG CCATCCCACC AATCTGAATT 3 2 7180 
GAAATTTTTT TACTCTGTGG CGTCAAATCC AGCATCTTCA ATGACCTCAA TCAGCTGTGC 32 7240 
AACATTGACT CGATTTTCAT CAAAAGTAAT ATTTGCTTTA CCCTCTAATT GCACATCAGC 3 2 7300 
AGATTGTACA CCATCTAATT CAGTTAAAAC TTGAGTAAGA CTTTTTACGC AACAGCCGCA 327360 
ATGTATGCCT TTTATGTTTA ATGTGATAGT TTTCATTTTT ACTCCTTGTT AGTTGTTTTA 327420 
TTGATGGTTA AACTATAAAC CTTTACATAA GGTTAAGGTC AAGAAGTTTT TTATGAATAT 327480 
TAGTGAAGCC GCAAAATTAG TAGGTTTATC TACAAAACAA ATCCGTGATT ATGAAAAAAT 32 754 0 
GGGGTTAATT AAACCCGCTG TTCGCAGTTT GTCAGGCTAT CGAAATTATG GAGAAAGTGA 32 7600 
TTTAGAAAGG TTGCATTTTA TTCGAcACTC ACGCAATGTT GGGTTTTCGT TACATCAAAT 3 27660 
CGCACAATTA TTAGCACTAC AAGATAATCC TAAGCGTAGC TGCCGAGAAG TGAAAGTGCT 327720 
AACTGCTCAA CATATTGCGA CATTAAATCA ACAAATAGAA CAGTTACAAA AAATGGTACA 32 7780 
AAAATTGCAG CACTGGCACG ACAGTTGTCA GGGCAATGAC AAC CCTGAAT GTTTAATTTT 32 784 0 
GAATGGCTTG AACGGATAAG TGAAAAGTGC GGTTAATATT CCCAGCTTTC TGGATCGACA 32 7900 
CCTAATTCAC GCATAAGCAC TTTGGCATCT TCAGGGATTT CATCATTACG CTCTTTCATT 327960 
AAATCTGCAT CGGTGGGTAA AGGTTGTCCT GTGAATGCAT GCAAGAACGC TTCACAAAGC 32 3020 
AATTCACTGT TAGTTGCGTG ACGTAAACTT TTTAATTGGC GGCGAGTGCG TTCATTCGTC 328080 
AAAATTTCCA GTACTTTAAT GGGAATGGAT ACTGTAATTT TTTTTACTTG CTCACTTTTT 32 814 0 
TTGCCATGTT CTGCATAAGG GCTGATATAT TTTCCGTCCC AATTCGC CAT AATTTCCCTT 3 2 8200 
AAGTG CTTAT TGATAACCTT ACAAATTGTC CGTATTCTAA TTAAAAATCA AATAAATAAC 328260 
AATCTAGACG GCTAAACTTC CAAAATAAGT TCTTAAAAAA AGTGCGATGA TTTTTATATT 3 2 8320 
AAATGCTAAT AAAAGCTAGA CAGCGTAGGG TTTTAGTGGT AATTTACGTG CCATTCAAAG 32 8380 
GGTGCAAGCC CACCATTCGA ATAATCAATT CTTATTAATT CCAACATTTA TTCATTATGC 3 2 844 0 
ATTTAACAGA ACTTAAAAAT ACGCCTGTTT CTGATCTTGT GAAACTCGGC GAGGAGCAAA 32 8 500 
TGGGCTTAGA AAATTTAGCT CGTTTGCGTA AACAAGATAT TGTCTTTGCA ATCTTAAAAC 3 28560 
AGCACGCCAA AAGTGGCGAA GATATTTTCG GCGGTGGTGT GTTAGAAATT TTACCAGATG 32 8620 
GTTTTGGTTT CCTTCGTTCT GCCGACAGTT CCTATCTTGC AGGCCCTGAT GATATTTATG 323680 
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G CAAACGCGT AATAAAACCA GATAGGTAGA ACCACAATGC GATCCCTAAG ATACCGCCTA 330420 
ATAAAAACAT TGTGAAATAA ATCATTGAAT AACAGAACCC ATATTAAAGA TAGGTAAATA 330480 
CATTCCCATC ATAATAATTC CAATCAGACT GCCGATGATC ACCATCATTA ATGGTTCTAG 330540 
CATTTGTGAG AGTAAGTCAA TTTGATGATT AAGTTTTTCT TGGTAATTTT CAGCAATATG 330600 
CTCTAGCATC AAAGCGAGTT TTCCGCTTTG CTCGCCAATT TGTAGCATTT GTTGTGCTTC 330660 
CATCGGGAAA AGATCGCTAC TTACACTATT AGAAAACGCA TAGCCTTGAG AAACCCATTG 330720 
CAAAATTGAC CGCACTTCTT TATCTAATAC GATGTCATTT ACAAGCGTTT TCTTGGTTTG 330780 
CCACGTTTGT GTGCGAGGGA GAAAACTGTC TAGTGCCTGA TTAAGCGGTA CGCCAGCTTG 330840 
CAACATAATT TGTAAACTTT GACTAAAGTT CACTAAACGT GAAAGCTTTT GAATTGTGCC 330900 
AAAAATAGGC G TG ATAGAAA TAAGTTGATT TTyCTTTTGA TAAAACCAAG TCTGGCGTTT 330960 
TAGATAGAAA TAATAAAATA GAAAAAAACT CAAAACGAAA AATAGCAAAA TGCCAATATT 331020 
TTGCTTAAGG AAATTAGATA TTGAGAGCAA TATTGCGGTT ATTGTTGGTA ACTCCGCATT 331080 
GTTGCCACTG TACATTTCAG CAAATTGAGG CACGATAAAA AGCAGTAATG CGAGTGTCAA 331140 
TAATAGAGAA ATTCCCAATA CCATTGAGGG ATATAACATA ATTTt CTGTA ATTTACG TTG 331200 
TAATGCTAAA GATTGGCTGC G ATGTGTGG C TATTTTTTTA CAAACTACGG CAAGTTTTCC 331260 
CGTCATTTCT CCCACTTGAA TCAGTTGAAT TTCTTGTTGT GTGAGATATT TCCCTTGTTG 331320 
TTCAATGGCT TGTGAGAATG CTAAACCAGA TTCAATAGAT TGAAGTAGTC GTTCAAGCCA 331380 
TTCATTGAGT ACAATTTGAG TACAATTTTG TTGCAAAATT TGCAGACTGT TTTTTAACGG 3 3144 0 
AATAGCGGAT TGTAGCAACG TGGCTAATTG ATTGAGTAAT TCGCTAATTT CTGAATTTTT 3 31500 
GGG CTTTGCC nAAATTGCCA GTTTTGTTGT AATTTGATGT GAGTAAGCCC GCGGCTTATT 331560 
AATTGAAAGT GTGCTTGTTG TTTTGTATCC GCAATAATTG AGCCTTTTTG TTTCTGATTT 331620 
AATGGGTTAC TAGCTTGATA ATAAAAGAGT TTTTTAGTCA TAAATTTTTA CCTAACACAC 331680 
GTTCAATTTC TTTCTC ATC C GTTATGCCTT GGCTAACTTT TTCTAAACCA CTCGCACGTA 331740 
AATTTTTAAA ATCCGTTTGA TAGTCATTCT GCTGCCAATG TAGAAATTGA TACACGCCGA 331800 
TTCGACCTCG ATAACCTTGA TGGCAATCAC AAGAATTTG C T AAATTTC C A CCGCACTTTG 3 31860 
AACAAAGCTT TCGTACAAGA CGCTGTGCAA TAACGAGCAA TAAACTGTTT TTAATTTCAT 3 31920 
ACTGTTGAAT ACCGAGTTGT TGTAAGCGAG AAATAGCAGA AATTGCATCA TTGGTATGTA 331980 
AAGTTGAAAG CACCAAATGC CCAGTTTGTG CCGCACGTAA TGCAATCATT GCACTTTCTT 3 3204 0 
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CATCGCGAAT TTCGCCTAGC ATAATAATAT CGGGATCTTG ACGTAAAAAA GTGCGTAATA 3321O0 
GGCGGTTAAA ATCTAATCCA ATCTGTGGAT TAATTTGGCT TTGAATAATG CCATCAAGCT 332160 
CAATTTCAAT GGGATCTTCA GCAGTCATAA TATGTTTATC AGGCGTATTT AGCCATTGAA 3 3 2220 
GTGCGGTGTA AAGTGAGATA CTTTTTCCAC TTCCTGTTGG GCCAGTGACT AAAATTAATC 33 2280 
CTTGTGGCTG GCTAAGCACA CGTTGAAATG CTTGTTGCTG ATTTTCTGTC ATACCTAATT 3 32340 
CAGAAAAGCT AAGTTCTACT GGTTTATTTT GTTGCGCTCG CAACACGATT TTCTCGCCCC 332400 
AATGGGTTGG TAAGGTTGAA AGGCGAAAAT CAAGAATATC GGAAAAAGTC GTTTTAAATT 3 32460 
GAAATCGCCC ATCTTGTGGA AGTCGATTTT CACTAATATC TAATTTAGCC AGTAATTTTA 3 32520 
AGCGTGAAAT AATACGATTG GCGAATATTT TGCTTATTAA GGGTTGTGGT TGTAAAACTC 332580 
CATCAATTCT AAACCGCACT TGAAATTGAT CAGCCAAGGT TTCTAAATGA ATATCAGAGG 332640 
CATTTTTTTG TAAGGCAGAT TCAAAAATCT GATTAAGTAA GCGGATAACA GGTTCATCAT 33 2700 
CTTCTTGTTC AAAATGGGTT TCTTGATGTT GATAGAATTT AACTTGTTCT TCCACTTGCA 332760 
TTTGGTTCGG AGTAAGTTGT TGTAACAGTT CTTTGAGTTG GCTGCTTTCT AACAAAATTG 33^820 
GTTCGACAGG TTTTCCCGTT ATAAACGCAA TGGTTTCACA GGCTGAAAGA TTGGAGAGAG 332880 
AATCAACCCC TAGCCAGAGA CGATTATTTT CTTCTTTAAG TGGCAAAGCA AAATAACGTA 3 32940 
AGAGCAAGGA TTGTTGCTGC TGATTGCGTT CCCATAAATC TGGCGAGATC GTAAATACTT 333000 
CGCCATTTTT AGCGGTTACA CGCTGAGTAT GAAGTAAAGC ATAGCTCGTC ATTTTGTGAC 3 33060 
ACTTCCGCAA AAATTTGCTG GAAATAAAGA GGCATCCGTT CCTTTGCAGG TTGTTGTCCA 3 3312 0 
AGTTACACCT GCTGCAGCAT TACCTTTAGC TTGCAAAATA TATTCCATAT TTGCCAATGT 33318 0 
GCCATTCCCT TTTACTGTAA TACCACCTGA TTGAGTGATA ACTGAGGCTA CAT AG CCTTT 3 33240 
TGCTGTCTTT ATATCCGCTG CAATACCATT TTTTCCCCCT GTACAGCTTG TTGTTTCATT 3 33300 
TGTGCTATAT ACACATAATT CCACATCAGC CTTATAAGGC GCAGACGCTT GCAGTAATTC 3 333 6 0 
GGATACCGCA GCTTTTTTGG TATAATTTTG ATAAGAAGGA ATTGCGATGG TGGCTAAAAT 3 3342 0 
AGCAATAATT GCAATCACAA TCATTAGCTC GATTAATGTA AACCCTTTTT TCAAGGTTTG 3 3 3480 
CAGTGTTGTT AATTTCATTC ATTTTCCTTT TATTAAGCCT TTGTTGGCAA CGACATTGTG 3 33 54 0 
AAAAGGAAAA ATTTGTTTGT AAAATTGACC GCACTTTTTC TGCGATCCTG ATCGCAAAAA 3 3 3600 
AAAGGAAAAA TGTATGCGAA AAATAAAAGA TATTGAGAAA GGATTGCTGA CCGATTGTCG 3 33660 
GCAAATTCAG TCACCTCACT TTGATAAACG CCCAAATCCG CAGGATATTT CCTTATTAGT 3 33 72 0 
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GAAAAATGCA 


ATGATTTTGC 


333960 


CATTGGAATT 


GAGTTGGAGG 


GGAGTAATGA 


ACAG CCTTTT 


ACCGACGCTC 


AATATTTTTC 


334020 


ATTGCAAGaA 


CTTAC CAATG 


TAATAATGAA 


AAGTTACCCG 


AAGATTACTA 


AAGATCGAAT 


334080 


TGTTGGACAT 


TGCGATATTT 


CGCCGAAACG 


TAAGATCGAT 


CCTGGTCAAT 


ATTTTGATTG 


334140 


GGAACGTTAT 


TTATC CAGTG 


TGAAATAATC 


TGTATAAAAA 


TTAAACATCA 


AATTTTGGGT 


334200 


ATTCTGTGTA 


TTCTAAATGC 


GtTCATTCTT 


TTACTTCTAA 


CCAGTTGAAT 


TAAAAGGAAT 


334260 


AAATTTAAAG 


CTGAAATTTT 


ATTAAAAAAG 


TTAATTTGAG 


TTATACAGTA 


TTATAGATTT 


334320 


TCTCAACCAC 


AATTTTATCC 


ACAGAAATAT 


TGAATAACTC 


AAAGAGATGG 


TAAAAAGATC 


334380 


TTTTGTGAAT 


AAAACCATTT 


TAAAAATTGT 


ATTTTTTTAC 


CGCACTTTAT 


TGCATTTTTG 


334440 


TATGTAAAAA 


GTGCGGTTTA 


TTTTTATTTA 


AGATCTTTTG 


GTTACAAATA 


ACCCGATTTA 


334500 


TTTTGAAGTT 


TGATGTGATA 


AGGATCTTTG 


CCCTCTTTTT 


ACGAGATCAA 


TTCACAATCA 


334560 


AAAATCCATT 


ATTCACAAGT 


TATCCACnAC 


ATAGATACTT 


TTCGTCATAT 


AAACAAAATA 


334620 


AGAATATTTT 


GTTGTCTAGA 


ATTTTCCCCT 


CTCCGTAAAA 


ATTGTG TATT 


TTTAACCGCA 


334680 


CTTTAGGTTT 


AAAC C CAATG 


TGCGGTTATT 


TTTTATTGAT 


TTTCCTCTTT 


TCGCGTTTTG 


334740 


CGATTTTTTC 


TAAAATATGA 


TAATTTTTCG 


CCGTCAATAA 


GACATCAAAC 


ACAAATCAAT 


334800 


ATGAACGACG 


AACAGCAAAA 


TTCAAACCAA 


TCTGAAAATA 


CAAAAAAACC 


TTTTTTCCAA 


334860 


TCTTTATTTG 


GTCGCTTTTT 


TCAAGGCGAA 


TTAAAAAATC 


GTGAAGAACT 


TGTGGAAGTG 


334920 


ATTCGCGATT 


CAGAACAAAA 


TGATTTAATT 


GATCAAAATA 


CGCGCGAAAT 


GATTGAAGGC 


334980 


GTGATGGAAA 


TTGCGGAACT 


TCGTGTTCGC 


GATATTATGA 


TTCCTCGTTC 


ACAAATTATT 


335040 


TTTATTGAAG 


ATCAACAAGA 


TTTAAATACT 


TGTTTAAACA 


CAATTATTGA 


ATCCGCTCAT 


335100 


TCTCGTTTTC 


CTGTGATTGC 


CGATGCGGAT 


GATCGCGATA 


ATATAGTGGG 


C ATTTTG C AC 


335160 


G CAAAAG ATT 


TATTAAAATT 


TTTACGTGAA 


GATGCGGAAG 


TCTTTGATTT 


ATCTTCATTA 


335220 


TTGCGCCCCG 


TCGTGATTGT 


GCCAGAAAGC 


AAAAGAGTGG 


ATCGTATGTT 


GAAAGATTTC 


335280 


CGCTCAGAGC 


GTTTTCATAT 


GGCCATTGTG 


GTGGATGAAT 


TTGGTGCGGT 


ATCAGGTCTT 


335340 


GTTAC CATTG 


AAGATATTTT 


GGAACAAATT 


GTTGGCGATA 


TTGAAGATGA 


ATTTGATGAA 


335400 
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GAAGAGATTG CGGATATTCG TCAGCTTTCT CGTCATACTT ATG CTGTGCG TGCGCTCACA 335460 
GATATTGATG ATTTCAATGC GCAATTTAAT ACGGATTTTG ATGATGAAGA AGTGGATACC 335520 
ATTGGAGGTC TGATTATGCA GACTTTTGGT TATTTACCAA AACGTGGGGA AGAAATTATT 335580 
TTGAAAAATC TTCAATTTAA AGTAACTTCT GCAGATAGTC GTCGATTAAT TCAACTTCGA 33 5640 
GTCACAGTGC CAGATGAACA TTTGGCAGAA ATGAATAATG TAGATGAAAA ATCTGAATAG 335700 
GATTTTATTA TCAATTAAAT TTATGAATAA ATATTTTACT TATCTTATTG CGATTATATC 335760 
AGGTTTACTC GGCGTTTTTG CCTTTTCGCC ATTTGATTAT TGGCCTTTAG CCTATGTTTC 335820 
TTTACTCGGT TTACTTTATG TCGCTAAAAA TCCTAAAAAA TCCACCGCAC TTTTATCCAC 335880 
TTTCTTGTGG GCGATGGGAT TTTTCTG CTT TGGAGTAAGT TGGCTTAATG TCAGCATCCA 335940 
TCAATTTGGT GGTGCATCGT TAGGGGTAAG TTATTTCCTT GTCGGTTTAC TTGCGGCTTA 336000 
. TCTTGCACTA TACCCGATGC TATTTACCTA TTTAGTTCAT CATTTCAAAG TGCAAAGTGC 336060 
GGTAATTTTT G CCGTGATTT GGACATTGAC AGAATTTTTA CGAGGTTGGA TTTTTACCGG 336 12 0 
TTTTCCTTGG CTTCAATTTG GTTATACGCA AATTGACAGC CCATTTTATG GCATTGCCCC 336180 
CATTTTTGGC GTAACAGGAT TAACGTTCTT TACTGTTTGG GCAAGTGCGG TCATTTTTAA 336240 
TCTCGTTTCC TCTTTGTTTA AAACGAAAAA CCTTAAATTA GTTTTAGCGA ATGCTTTGTT 336300 
ATTGATCATC GTGGGGGGAT TAAGTGCTTA TTCATCCCGA ATTCACTTTG TAAAATCTGT 336360 
TGAAGATAAG GCAATTTCAG TCACGCTTGC ACAAGGTAAT ATTGAACAAA ATCTTAAATG 336420 
GGATCCTAAT TATTTCTATT CTACTTTGGC GATTTATCAA AAATTAATCA CAGAAAATCT 336480 
TGGTAAAACC GATTTAATTA TTTTGCCTGA ATCAGCCTTG CCAACGCTTG AAAATGCGAT 336540 
TACGC CATTT TTTGAGGGAT TAGAGCGTGC AGCTAAAGAA ACGAAAACGG AAATTATGGT 336600 
TGGAACTGTA TTCCAAGATA CAAAATCTGG TAAGTTATTA AATTCAATTA TGACTG CTGG 336660 
AAACCCAGAT TTTCCTTATC AACCAAATAC GCAAAATCGT TATAATAAAC ATCATCTTGT 336720 
GCCTTTCGGG GAATATGTTC CGCTAGAAAG TATTTTGCGA CCACTTAATT CAGTGTTTAA 3 36 78 0 
TTTACCAATG TCAGCATTTC AAAGTGGGGA GGCAGTTCAA CCCTCTTTAA TCGCAAAAAA 336840 
ACGTGCTTTT TCGCCAGCAA TTTGTTATGA AATTATTTTC GGTGAGCAAG TGCGACAAAA 336900 
TTTGAAACAA GATACGGATT ATTTGCTCAC GTTATCTAAT GATGCGTGGT TTGGCGATTC 3 36 96 0 
TATCGGGCCT TGGCAACATT TACAGATGGC AAGAATGCGT G CTTTAG AG T TAGGCAAACC 3 3702 0 
GCTTATTCGT GCAACAAATA CGGGGATTTC AGTTTTTGTT GATGCGCAAG GTAAGGTATT 337080 
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AGCACAAGCC CCGCAATTTA TTGAAACAAC TTTGACTTAT AAAATTG C AC CTGCGGAAGG 3 3 714 0 

CAAAACCCCT TATTCTGTGT TAGGAAATAT GCCTTTGTAT GCTTTATCTT TATTGTTTTT 3 3 7200 

ACTCTTGCAC AGTATGATGG CGTTTATTCG TCGTAAGATG AATATCTTAT AAAAAC AC C A 3 3 7260 

GTTAAAATGA CCGCACTTTT TCATTCATAT AAATAAACAT TATGCGCATA CCAAGAATTT 3 3 7320 

ATCATCCAAT ATCCCTTGAA AATCAAACTC AATGTTATCT TTCAGAAGAC GCGGCAAATC 3 3 73 80 

ACGTTGCTCG AGTGTTAAGA ATGACAGAAG GCGAACAGCT TGAACTCTTT GATGGTTCTA 3 3 7440 

ATCATATTTA TCCAGCAAAA ATTATTGAGT CAAATAAAAA GTCCGTAAAA GTAGAGATTT 3 3 7500 

TGGGGCGTGA GCTTGCGGAT AAAGAATCCC ATTTAAAAAT CCATTTAGGA CAAGTGATTT 3 3756 0 

CTCGCGGAGA ACGAATGGAA TTTACTATTC AAAAATCCGT AGAATTAGGG GTAAATGTGA 3 3 7620 

TTACGCCATT ATGGTCAGAA CGCTGTGGCG TAAAACTTGA CGCAGAGCGG ATGGATAAAA 3 3 7680 

AAATCCAACA ATGGCAAAAA ATTGCGATTG CAGCTTGTGA GCAATGCGGT CGTAATATTG 33774 0 

TGCCAGAGAT TCGTCCACTG ATGAAATTAC AAGATTGGTG TG C AG AAAAT GATGGTGCAT 3 3 7800 

TGAAATTGAA TTTACATCCA CGTGCGCACT ATTCTATTAA AACCTTACCG ACTATTCCTG 3 3 7860 

CTGGGGGTGT TCGCTTATTA ATTGGCTCGG AAGGTGGATT ATCAGCACAA GAAATTGCCC 3 3 792 0 

AAACTGAACA GCAAGGATTT ACTGAAATTT TATTAGGAAA ACGTGTTTTA CGCACTGAAA 33798 0 

CTG CATC ACT TGCAGCGATA AGTGCATTGC AAATTTGTTT TGGGGATTTA GGCGAATGAT 3 38040 

GGAATTACAA GGAAAATTTT TGATCGCAAT GCCACATTTA GACGATTATT TTAATCGCAC 3 3 8100 

GGTGGTGTTT ATGTGTGAGC ATAATGAACA GGGTTCAATG GGTTTGGTAA TTAATC AG C C 33816 0 

GACAGATTTA AGCATCGCGG AACTTTATTC AAAACTCAAT TTTATGATGA AAAATG AC CG 33 8220 

CACTTTTGGT AATGAAATGG TGGTTGCGGG TGGGCCTATG CATACAGAAA GAGGGTTTAT 3 3 8280 

TTTACATAAG AATACGCTTA ATGCTTTTCA ACATACTTAC AAAGTCACAA AGGAACTTTC 3 3 3340 

AATGACAACC TCCGCTGATG TGGTGGAAAC GCTAGGTTCA ACTTTTGCAC CTGAAAAATA 3 384 00 

TCTTGTTGCG CTAGGATGTT CAAGCTGGGG GGCTGGTCAG TTAGAAAAAG AAATCAGAGA 3 3 8460 

TAATGCGTGG TTGGTGGTGT CATCTAACGA T C AAATTTT A TTTGATATGC CCTATGAAGA 338520 

TCGTTATGCT GCAGCCAATC AGTTACTTGG TATTCATCCT TATAATTTTG CGTTAGCGCA 3 38 580 

AGTGGGGCAT AGTTAATGGG AATTACCGCT CTGGCTTTTG ATTTTGGCAC TAAAAGTATC 3 3 864 0 

GGTTGTG CCA TAGGACAAAG TATTACTGGC AC AG C AC AAG CCTTGCCCGC TTTTAAGGCG 3 3 8700 

CAAGATGGCA TCCCAAATTG GGAAGCGATC GAAAAATGTT TGAAAGAATG GAAACCAGAT 3 3 876 0 
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GTTGTCATTG TTGGTCTTCC TCTAAATATG GACGGAACAG AACAAGATCT CACTCTACGT 338820 
GCTCGTAAAT TTGCAAATCG TTTACAAGGT CGTTTTGGCG TTAATGTTCA TTTACAAGAT 338880 
GAACGATTAA CCACCACGCA AGCTCGTAGT GAGATTTTTG AACGTGGTGG TTTTAAAGCA 338940 
TTAAAAAAAG GCAAAATCGA TGGAGTGTCA GCTTGTTTAA TTTTAGAGAG CTGGTTTGAA 339000 
TACGCTGAAT ATTAGATTCA CATTGTAAAA ATAAACTTCA TTTTAGAATG ATTAAAGCGC 339060 
GATTTTATAA TCTCGCTTTT TCGCCACCAA ATTCATCTAA TAAATTTTCG GTGAGTTTGG 339120 
CTAAAATACC AGTCATCAGT ACAAAATCTG CATCGAAACG CTGTGCAAAA TCTTCTTTCA 339180 
GAATATCATC ATTTTTTTCC CGTACAGTAT CTGCAAATTT TAAACGTTTG ATTGTGCAAT 339240 
CTTCGTTAAA AACGAAAGTT AAGGTATCTT CCCATTCTAA GGCTAATTTA CTTACTACTT 339300 
TTTTGCCATC TTGCAGAAGA GCGAGAATTT CTTCATTTTC TAAGGGTTGT TTTTTACAAC 339360 
GAATAACGCT GTCTTCCTGA CTGCCACGAA GCTCGGCTTC TTCTAATGCA AGTAACCAAT 339420 
G CGGCAAATT ATCTTGAAGG ATCCAATTGG TTAAAATCGT G CTGGGTTCA TTAGCAAAGG 339480 
CTAAAGGCAC GACAGGGAGC GAACCTAAGG ATTTACGCAG TAATGCGAGC GCATCTTCCG 339540 
CGCGTTTACT TGATGCGGCA TCAATATGAA TAAGATTATT TTCCGTATCA ATCCATAGTG 339600 
CCGTATGCTG ATTTTTACTA AATGCACGAG GTAATAAATT CATCACGACA TCATCTTTTA 33 9660 
ATGTTTGTTT TTCTACCTTT TTAAGTTTGC GATTTTCTTT TTGTTCAAGG CTTTCAATGC 339720 
GGTCATCTAA TTCACGTTTG ACCACATTTG CCGGCAAGAT TTTTTCTTCT TTTTTGG CGA 339780 
TTAATAGAAT TTGTTTTCCA ACAGAAAAAT AGAGTAAATC ACTTCCTCGT AATGGAGCAG 3 3 984 0 
ACCAGCCAAA TTTACTTTGA TCTTGAGTGC CGCAAGGATG GAATTGACAA TCTTCCAGTT 339900 
GAGTTTGTAG CTGTGCAAGA TCCCAATCTA GCGGTTTAGT CAGTCTATAA GTCATTAAAT 339960 
TTTTGAACCA CATCGTAACA CTTCCTTCAA TATAGGGTAG AATGTGCGGT ATTATAGCCC 340020 
ATAAAAGTAG GAAAAACGAT GCAACACAAA TTGATTGCCT TTATTGGTGG CGGCAATATG 340080 
GCACAAGCTA TTATTCTTGG ATTATTGAAA CAAGGTTATC CTGCCGAGCA AATTATTGTG 340140 
AATGATCCAA ATGAAGAAAA AAGAGCATTT TTTGCTAATC TTGACGTGGC AACTTCAGAA 340200 
AATAATGTGG GAAGTGCAAT CAAAGCTGAG GTTGTGTTGC TTGCGGTAAA ACCTCAAATG 340260 
ATGGCTGAAG TTTGTTCGCC ACTAAGTGCG GTGGATTTTT CCGACAAATT GTTAATTTCC 340320 
ATTGCTGCGG GTATTTCAAC TGAACGCTTG AACGCATTAA TTCCAAGCGT AAAATCGATA 340380 
GTACGCGTCA TGCCAAATAC GCCCGCATTG G TGGGGG AAG GTATGGCGGG TTTATTTGCA 340440 
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«w v» \jivinnn X n 


v_vj>-t\j i VjAAAA 


x TACCGCACT 


TTTGCACAAG 


ATTTATTGGG 


AGCAGTGGGA 


340500 


L\ CZ 7x 71 f*f^fZT*T*T' 


wVjvj X uHA X vj>\ 


TP A A A PTPA A 

I GAAACTCAA 


ATGCACGCAG 


TCACTGCCGC 


GTCAGGAAGT 


340560 


rtV?d*w 1 u^L 1 


ATTTCTTTTT 


AATG CTGGAA 


GCAATG CAGA 


AAGCCTTAAT 


CAAAATGAAT 


340620 


H , T**TV!I 71 TP. A a ft 


AAALAUCu CG 


TGAATTAGTC 


CAGCAATCTA 


TGCTTGGTGC 


AGCAAAAATG 


340680 


u X <jAv~ AVa AAA 


a *r*r*r*f*f* TV TV TV T* 

ATC CG CAAAT 


TGCACTTTCA 


ACTTTGCGTG 


AAAATGTTAC 


CTCGAAAGGT 


340740 


ppp a rfh p*pp 

t> UAL TACTG 


CGGCAGCGTT 


AG CAGTATTT 


GACGCACAAC 


ATTTTAATCA 


AACAATAGAG 


340800 


p A T\ p PT A TPP 


AGGCTTGTTT 


GTCTCGTTCA 


CAAGAAATGG 


AAACTTTATT 


CTAATGCAAG 


340860 


TPPCTPPT' 1 M 1 1 
iv.v.vjlLL'l X 1 


I lAx ItsGC I7L 


GCACTGAGCT 


TTCTAGGTTA 


CTACTGCGCT 


TATGGCGTGT 


340920 


XXX 1 VJU^VJ X X 


V_ X X LLLLuLA 


^*P^ , *T»^H/"* TV TV TV *T* 

I GGTTGAAAT 


CG CAAC AAT A 


TGGCGAAGAA 


ATGATTGGTT 


340980 


X X Vjni 


rp TV PT'P PTT A T 

1 Ao 1 VjC I TAT 


ATTTTCCGAT 


TTGCGGGCGG 


ATTATTTTTT 


TCATCG CTGA 


341040 


1 X M/WuiAAut- 


P A A TP A r P A T'M 

tAAT CAT ATT 


ATTATTGCCT 


TACGCCTATT 


GGCGTTAGCA 


TCGGCAATTA 


341100 


x xAIijtjCurGC 


TATGAGTTTA 


GTTGCGCATA 


ATTTTTGGCT 


G TT ATTT ATT 


GCTATCGGTT 


341160 


iulA TvaCTTC 


TGTG AATT CA 


GCAGGAATGC 


CGATTGGGGA 


TTCGTTGGCA 


TCCACTTGGC 


341220 




C GG CTTGG AT 


TACGGCAAAG 


TACGTTTAAT 


TGGCTCTGCT 


GCTTTTATCC 


341280 


T"T'ppppt apt 1 


GGTGTTCGGC 


GGCATGATTG 


GCTGGGTTGG 


CGAG CAAAAT 


ATCGTGTGGA 


341340 


I i nAACCGC 


ACTTTTAT CT 


TTCTATACCA 


TCATTCAGCT 


TTTAAAACCC 


ACCATTCCCC 


341400 


CAAAAGATGA 


AATACCTGAA 


GATAG CGTTC 


AAAATGATGT 


TGGTTTTATT 


GCTTTACTGA 


341460 


Ti k it it fT> f*^ % m 

AAAAT C C GAT 


GACATTGCGA 


GTTATGATAG 


CAGTAGGG CT 


GATTCAAGGC 


TCTCATGCGG 


341520 


til ATTATGT 


TTATAGCACG 


ATTTATTGGA 


CAAGTATTGG 


GATCTCCGTT 


TCTCAAACTA 


341580 


VjC x I AC x I TCj 


GGGCATTGGC 


GTATTAG CGG 


AAATTGTATT 


ATTTTTCTTC 


TCTCGTCGCT 


341640 


T7ATTPPA A TV 7\ 

xAl ILLAAAA 


T ATTT C TAT C 


AGTGTGCTGC 


TTTATATTTC 


TGCGCTTGCG 


TGCGTTGGAC 


341700 




TV TV rr* TV TV FT* 

AATAGGATAT 


ATTGAAGATT 


TCTGGTTGAT 


ATTCTTATTG 


CAACTTATGC 


341760 


TV TV ^* 1 1 if (iff* m •« « 

ACAGTTTAAC 


CTATGCCGTT 


TGCCATTATG 


CGATTGTTCG 


CTATATCACC 


ACGCAACCGC 


341820 


Tv 7v TV p P/*^ Tv rn t\ 
AAAGCCATAT 


TGCGAAGTTA 


CAAGGTTTGT 


ATAATGGTTT 


ATCCAATGGT 


GTGCTCATTG 


341880 


CTATTTTCAC 


TGCGATTGCT 


GGCATGATTT 


ATCCAACGTC 


GCCAGCGATG 


ACATTTGTAT 


341940 


TTATGAGTAT 


CATTG CAGCT 


GCTGCATTTT 


TTGTGATTCC 


ACGTAAGTTA 


AATGCCTTTT 


342000 


TGATTGTTCA 


GCATAAATAG 


GACGCATTGC 


TTTCCTTGTA 


GAAAAGTAAT 


ATGAAAAATC 


342060 


TTGCGCTCAT 


TGATTTGTTT 


CTTAACGAAT 


ATTGGATTGA 


AAAAGGGCTT 


TCAGAAAATA 


342120 
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CCGTTCAATC 


TTATCGTTTG 


GATTTAACCG 


CACTTTGTGA 


TTGGTTGGAT 


AAAAACGACT 


342180 


TGTCTTTGGA 


AACTTTAGAT 


GCTGTTGATT 


TGCAAGGTTT 


TCTAGGAGAG 


CGTTTGGAGA 


342240 


AAGGTTACAA 


AGCCACGAGC 


ACTGCGCGAA 


TGTTAAGTGC 


AATGCGTAAA 


CTTTTCCAAT 


3423 00 


ATTTATATCG 


GGAAAAATAT 


CGAGTGGATG 


ATCCAAGTGC 


GGTG CTAAGT 


TCGCCTAAAT 


342360 


TACCCAGTCG 


CTTGCCAAAA 


TATTTAACCG 


AGCAGCAAGT 


GTCAGATTTG 


CTCAATACGC 


34242 0 


CAAATGTTGA 


AGTGCCTTTG 


GAGTTGCGTG 


ATAAAGCAAT 


GTTAGAGTTG 


CTTTATGCAA 


3424 8 0 


CGGGGCTTCG 


TGTAACGGAG 


TTAGTTTCGC 


TTACGATTGA 


AAACATGAGC 


GTGCAACAAG 


3 42 54 0 


GCGTGGTGCG 


AGTGATT GGT 


AAAGGGAACA 


AGGAACGTAT 


TGTGCCAATG 


GGGGAAGAAG 


34 26 0 n 

-J T A> W \J \J 


CTGCCTATTG 


GGTTCGTCAA 


TTTATGCTTT 


ATGGTCGCCC 


TGTGCTATTA 


AATGGGCAGA 


3 42 6 6 0 


GTTCTGATGT 


TGTGTTTCCA 


AGTCAGCGCG 


C AC AG C AAAT 


GACTCGTCAA 


ACTTTCTGGC 


342 72 0 


ATCGTGTAAA 


ACATTATGCC 


ATTCTCGCGG 


ATATTGATGC 


TGATGCACTT 


TCTCCTCACG 


342780 


TTCTTCGCCA 


TGCTTTTGCC 


ACCCATTTAG 


TAAATCACGG 


TGCGGATTTA 


CGCGTGGTGC 


342840 


AAATGTTGTT 


GGGTCATACA 


GATTTATCTA 


CAACACAGAT 


TTATACTCAC 


GTGGCAAAAG 


342900 


AAAGATTGAA 


ACGATTGCAT 


GAAAGATTTC 


AT CCTAGAGG 


ATAATTTCGG 


GGTAAATTT'T 


342960 


TATAGTTTTA 


AAGATTGATT 


GTGGATTTCT 


TATTGATTAA 


TTAATTTGAC 


ACGCATCACA 


34 302 0 


AATTTTGAAT 


AAAAAATTAG 


ACAGCTCAAG 


CATATTTTGC 


TATCTTGCGA 


AAAATTTCGA 


34 3 080 


CATTTGTTCT 


ATCGAAAAGT 


AATTTTGGAT 


TCTAAAATGA 


AAAAAATCAC 


GCTTTTTTTT 


3 4 314 0 


ACCGCACTTT 


TATGCTTATT 


TTCAACTTCT 


GTTTTAGCCG 


AATCTAATTC 


AATACCTAAA 


34 32 00 


CAATGTGAAC 


AACTATTTAA 


AGAAACAGAG 


AATTTGATTG 


CCCAAGCAGA 


AAAACAACCT 


3432 6 0 


GGAACCCACA 


TTCAAGTAAA 


TAAAATCAAA 


AGTAAACTAA 


ACCAAAGCAA 


ACAACAAATT 


7 A ~K ~K o n 

J *4 J J j£ \J 


CTTCAAATGG 


AATTAGCGAC 


TCAGTTAAAA 


AGTTGTGAAG 


TGGGTTTAGC 


AAAATTAAG T 


343 3 8 0 


AACTTGAAAA 


GTTATAGTGA 


GTAATTAAAA 


AAGCCTTATT 


GAGTACAATA 


AGGCTTTTTT 


34344 0 


AGTGGTTAAA 


TTTATCTAGT 


CTGATAATTT 


CTGCAAACCA 


AAATGACGAT 


AAGTTTGCGA 


34 3 500 


TGTTGCAATA 


CGGCCACGTG 


GCGTTCTTTG 


TAGAAACCCC 


TGTTGAATCA 


AATAAGGCTC 


343 560 




i LAAibb TAT 


UAU^TTCTTC 


GCCAATTGCA 


GCAGCAAGAT 


TATCTAGCCC 


343620 


C AC AGGGC C A 


CCATCAAAAC 


GCTCAATAAC 


TGCAGACAAC 


AACTTTCTAT 


CTAAATAATC 


343680 


AAATCCAGCA 


TCGTCTACAT 


CTAG CATTG A 


AAGTG CTTGT 


TTTGCCACAT 


TTATAGAAAT 


343740 


AATACCGCCA 


TTACGTACAT 


CTGCATAATC 


ACGCACACGT 


CGTAATAAAC 


GGTTTGCAAT 


343800 
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TCGAGGCGTG CCTCGTGAAC GACGAGCCAC CTCAAAAGCG GCTTGTTGTT CTAATTCAAG 343860 
ATTTAAACAG CCCGCACTTC TTGCAACGAT AGATGTTAAA TCTTCTACAG AATAAAACTC 343920 
CAAACGTTGC ACAATACCAA AACGATCACG CAATGGAGAA GTCAAAGAAC CCGCTCTAGT 343980 
GGTCGCGCCA ACCAAAGTAA AAGGGGGCAA ATCTAATTTA ATGGAACGAG CAGCGGGTCC 344040 
TTCGCCAATC ATAATATCCA ACTGATAGTC TTCCATTGCT GGATAAAGGA CTTCTTCAAT 344100 
CGCAGGGGAA AGTCGATGGA TTTCATCAAT AAATAACACA TCATGCGGTT CAAGATTGGT 344160 
TAGCATTGCA GCCAAATCGC CCGCTTTTTC TAACACAGGC CCTGATGTCG TACGAATATT 34 4220 
CACGCCCATT TCATTTGCCA CAATATTAGC AAGGGTTGTT TTTCCCAAGC CAGGAGGGCC 344280 
AAAAATCAAT AAATGATCTA AGGCATCTTG ACGTAGTTTG GCTGCCTTGA TGAAAATATC 344340 
CATCTSSTCG CGSACTTGCG GCTGCCCAAC ATAATCTGCT AAT AG TTTAG GACGAATAGs 344400 
ACGGTCAATA ACATCTTCAT CAACCTTAGC TTGCCCGCTA ATAATTCTAT CTGCyTCAAT 344460 
CATATTTTTC CTACAATGCG ACTTTTAATG CTTCACGAAT AACTTGTTCA CTGGTTAATT 344520 
CAGGTTTAGC AACGCGTTTT ACCATTTTTT CTGCTTCTAC AGGTTTATAA CCTAAAGCAA 344580 
TCAAGGCTGA AATTGCTTCA TCAGATGAAC TTTCCGAGTG GGATTCAATA GATGGGGATA 344640 
ATGGAATATG CGTACTTTCT ACAAAGAAAT CGCTTTGTTT TACGCCTTTA AATTTACCTT 344700 
TGAGCTCAAC TAACAAACGT TCAGCTGTTT TTTTGCCAAC ACCTGGAATT TTAGTAAGTT 344760 
TAGAAAGTTC TTCTCTCTCT ATTGCATAGG CAAATTGTTC GACCGACATG GCGGATAAAA 344820 
TCGCTAAGGC TAATTTAGGC CCCACCCCAT TTGTTTTAAT TAATTCACGA AATAAAGTGC 344880 
GGTCTGTTTT CTGGGCAAAT CCAAAGAGTA AGTGAGCATC TTCCCGAACA ACAAGATGGG 344940 
TGAATAAAGT TGTTTCTTGT CCAATTTCTG GTAAATCATA GAAACTCGTC ATCGGCAAAA 345000 
GCAATTCATA GCCTACGCCT TGCACATTGA GTAAAATTTC TGGAGGTTGT TTTTCTAAAA 345060 
GAATGCCTTG TAAGCGACCG ATCATAATTC TTCCGAATGA TAAAAAAATT AACGCATAGA 345120 
ATAAAATAAA ACTGGACGAA CATCCAGTTA AATTTTTAAT CTAAAGCGTC CTCGGCTATA 345180 
TCTGGTCTTT AAAAGTGCGG TCATTTTTTC TTGCGTTTCT GTCATTTTCA CAGAATTGGC 345240 
AATATGTAAA GAATGTTGAA TAGAATGCGC GTGTGTAATC GCAATAGCCA ACGCATCCGC 345300 
TGCATCGGCT TGAGGTTTAT CTGACAACTT CAAAATACGA GTCACCATTT CTTGCACTTG 345360 
CACTTTATCA GCGGAACCAA TGCCCACCAC TGTTTGTTTT ACTAAACGTG CGGCATATTC 345420 
AAAAACAGGT AAATCATTAT TTACCGCTGC AACAATCGCC GTGCCGCGAG CCTGCCCAAG 345480 
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TTTCAATGCT GAATCCGCAT TCTTCGCCAT AAACACTTGC TCAATCG CAA ACATATTAGG 345540 
TTGAAATTGC GTGATGATTT CAGTTACTCC AGCATAAATG CGTTTCAAAC GGGTGGGTAA 34S600 
ATCTTCAACT TGAGTACGAA TTGCGCCACT ACCGAGATAT TCTAAATGTT TTCCTGTTTG 345660 
ACGAATCACA CCGTAACCAG TTACGCGAGA GCCTGGGTCA ATACCTAAAA TAATGCTCAT 345720 
AATTTCTATA AAAAATGGGT AATTAAATTA CCCATTTATT GCATCATTAA AGTTGAGATG 345780 
CCACTTCGTC ACAGATTTCT CCGTTGTGAT ATACGTTTTG TACGTCATCA CAATCTTCCA 3458 40 
ACATATCAAT TAAACGAAGT AATTTTGG TG CAGTTTCAAT ATCAAGATCG ACGGTTGTTG 345900 
ATGGAATCAT CGTTACTTCA G CTTCTTGTA CTTTAAAACC AGCTGCTTCA ATGCCATCGC 345960 
GCACTGAACC TAAATCTTCC CAAGCAGTAT AGATTTCAAA TGAACCATCA TCTTGTGGTT 346020 
GAATATCATC AGCACCCGCT TCGATTGCCG CTTCAGTTAA GG CATC TTCA TCCGCTTCTG 346080 
CGATTAAGAT TAAACCTTTT TTGCTAAATA AATAACCCAC AGAACCTTCT GTTCCCAAGT 346X40 
TACCACCACA TTTAGTAAAA CTTGGGCGTA CTTGTGAGAT CGTACGGTTT GCATTATCAC 346200 
TTAAACATTC CACCATAACC GCTGTACCGC CTGGGC CAT A ACCTTCATAG ATTTTGGTTT 346260 
CCATATTGGT ATCATCGCCA CCGCCTACAC CACGATCAAT AGCGCGGTTG ATAGTATCGC 346320 
GCGTCATATT GTTGCTAAGT G CTTTATCTA CTGCTGCACG TAAACGTGGG TTAGCACTCA 346380 
CATCGCCACC ACCAATTTTA GCTGCGGTAA CAAGTTCACG AATTAATTTA GTAAAAATTT 346440 
TACCGCGTTG TGCATCTTGT GCTGCTTTGC GGTGTTTAAT ATTAGCCCAC TTACTATGAC 346500 
CTGCCATTTT ATTTCCTTCT GTTTTGATCT TATTGATCTT GATTTAATTA AACGTTTTGA 346560 
CCTAGGCTAG GTATAGCCTA GGTTAGTAAA AGTGTTACCC CGTTGGGAGT GGATGTAGCT 346620 
TTTCCCAAAA GAGAACCGCT ATCACCATAA TTAGCTGTGA TTATACTTAC CTACGGAGAT 346680 
TAAAAAGATA TTTTTTAATA GCTTCCGCAT TATTCTGGGA TTTCGTCATT TCTACTGCTT 346740 
TTTTCGCCGA CACCCAGsAA AAATCTAAAT GCTCACTCAA TACTGGAATA AATTCTTTTT 346800 
CCACCTCACA CAAAAACCAA TGTTCTTTGC AGTGAGTAAT ATTCGGTGCG T ATTT AT AG C 346860 
GGAAATGTGG AAAAATTTCA AATTCTATGC TCTCATTGCA ATCAAAAAGT GCGGTGGAAT 346920 
TTTCCGAAAT GTCTAACCGC ACTTCTTCCC ATAGCTCACG AATTGCTGTT TTTTTTGGTG 3 4 6980 
TCTCATCACT TTCAATAGTG CCAGTAACAG ACTGCCAAAA ATCAGGATCA TCTTGGCGTT 34 7040 
GGAGCATTAA AACTCTGTTT GTATCTTTAG TGTAAATTAC AACGAGAACA GATTGATTAT 347100 
TTTtGTATTG CATCATTAAA AATGCGGTCA AATCTGACCG CACTTCATTC CTAATATTTC 347X60 
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AAACTTATTC 


: TGCTTTAACG 


; ACAGAAATTG 


! CCAACTCTTC 




GGATTTGCAA 


347220 


AACTCGGGGC 


! TTCCGTCATT 


' AAGCACGCTG 


CTGCCGTGGT 


TTTPGGAAzir: 


' L? LAA 1 CACAT 


347280 


CACGAATGTT 


, TTCCGTGCCT 


GTTAAAAGCA 


TGGTTAAGCG 


GTPTAA2XPPG 


AAICjCTAAAC 


347340 


CAGCATGTGG 


TGGAGTACCA 


AATTTTAACG 


CATCTAATAA 


G A 21 21 PPP IV AT 


TTCTCTCGTT 


347400 


GTTGTTCTTC 


GTCAATACCA 


AGAATACGGA 


A C AC AG TTTG 


1 HjLAITTTC 


GGATCAAAAA 


347460 


TACGCACAGA 


ACCACCACCC 


ACTTCGTAGC 


CGTTGATGAC 


PZYTZlTPr"T , 7\ A 
^-M. iHl LG X AA 


GCATTGGCTA 


347520 


CCGCACTTGT 


TGGATCAGC C 


TCTAATTGCT 


CTGGGCTGAA 


rl X \_ X 111 LlVj 1 


G AAG TG AATG 


347580 


GATGGTGCAT 


TGCTGCAAGA 


TTACCTTCTT 


CATC ACG TT C 




^ ^ ^^^^ y± * ^^^^ » 

AAA T C AATG A 


347640 


CCCAAAGCGG 


TTGCCATTCG 


TCTAAACGAG 


TTAAGCCAAG 


ATP A PG 21 PPT 


AA1 X rL-AAAC 


34 7700 


GTAACGCACC 


CATTG CATC A 


GTGGTAGTTT 


GCCATTTGTP 


X bLALL AAAG 


AATAAAATAT 


347760 


CGCCAGTTTG 


TGCATTCACA 


CGTTCTGCTA 


ACCCTTTPPX* 


X ALA x V_ X TCA 


TTTAAGAATT 


347820 


TCGCAATCGG 


ACTTTGCACG 


CCTTCAAGAP 


PAGPZ1TTZ1 21 T 


AlL bTTTACT 


TTTG CCCAAG 


347880 


CCAAACCTTT 


CGCACCGTAG 


ATG CCTACAA 


ATTGTGTAT21 


A X LA I LAATT 


TGTTTACGAG 


347940 


TAATTTCTGC 


AC C ATTTGGT 


ACTCGAATGA 




GL LATTTGGA 


TTGTTTGCTG 


348000 


GCTCATTAAA 


TACTTTAAAT 


T CAAC AT CTT 




1 GL JTACATCT 


ACCATTTCTA 


348060 


ATGGATTACG 


CAAATCTGGC 


TTATCAGAAP 


c AAAA rf; 2i rr 


LAI xGLTTCT 


TG C C AAGTC A 


348120 


TCACTGGGAA 


TTTACCTAAG 


TCCACAC CAA 


TGGTATP A X1G 


LLA X AAGLLG 


TGCACCATGC 


348180 


GTTCCATAAT 


TTCGCGGACT 


TCTGGCGCAG 


TTAGG A 2121(7121 


AG 111 lCACA 


TCGATTTGAG 


348240 


TAAACTCAGG 


CTGACGATCT 


GCTCGTAAAT 


PTTPATPAPG 


oAAALA xTTT 


ACGATTTGAT 


348300 


AATAGCGGTC 


AAAAC CAGAC 


ATCATTAAAA 


GCTGTTTGAA 

s ^ V* X Srf XXX WATT 


AZ*GPTGTr , r , T 


GA nbCGGCA 


348360 


ATGCATAGAA 


TTTGCCTTTA 


TGCACACGGC 


TTGG CAPTA A 


A 1 A<*7 X L. oL oL 


/"** ^* ^< *^ ^ rr* r+ 

GCGCCTTCTG 


348420 


GCGTTGCTTT 


GGTTAGCATT 


GGGGTTTCAA 


TATCAAGAAA 


nL^.Al X Al L,A 


T L lATAAAG C 


348480 


GACGCACAAA 


GCTGGTGATT 


TTTGPAPGGG 


TTTTPZl aapr 

x x x X ^.nnhCu 


L. IUL-LjLCATT 


TCAGGACGAC 


348540 


GTAAATCTAA 


ATAACGGTAT 


TTTAAAPGTT 

A A l/Wi\,VJ ii X 


GTTPTTPGGT 


Lj 1 X A X X 1TGG 


TTGAAG T CT A 


34 8600 


ATGGTAAAAC 


GTCAGAAGCA 


TTGTAGATGC 


GTAATTCTTT 


CGCTAACACT 


TCCACTTCGC 


348660 


CTGTTGC CAT 


ATTTTTATTA 


ATTTGATTAT 


C AGG ACG AG C 


GATCACCTCG 


CCTTTAATTT 


348720 


GAATACAGAA 


TTCATTACGT 


AACCCAGCAG 


CCGCTGTCAA 


TGCATCTTGA 


TATTTAGGAT 


348780 


CAAAACAAAC 


TTGCACAATA 


C CAT C ACG AT 


CGCGCATATC 


AATAAAAATC 


AAGCCACCTA 


348840 
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AATCACGACG 


GCGATGAACC 


CAACCGCTTA 


ATGTTACGTC 


TTGTCCGATA 


TTGTTACGGT 


348900 


TTAATGCTCC 


GCAATAATGT 


GTACGCATCA 


TTTCAAAATC 


CTTTGTACTC 


TTTTAATTAT 


348960 


GCTCAAAAAT 


CTGTCGCAAT 


TATAGCGGAA 


AACCAAGAGA 


GATAAAAAGG 


TTATTTATGC 


349020 


TGACAAATCA 


ATCAAAAAAG 


CAAAGAAAAG 


TAATAAATAT 


TCAAAATAAA 


GGCTTTTTTG 


349080 


TGCCATATCA 


TCAGATTTTT 


TGTTATATTA 


TGCCGCTTGT 


TGAGATTAAA 


TTACGTTAAG 


34 9140 


GATAACCACA 


TGTTATTTAT 


CAATATTACT 


TTTGCCTGTA 


TTTTAGCGAT 


CCGTTTTTAC 


34 9200 


AGTCTGTCTA 


TTTCAATTCG 


TCATGAAAAA 


GCACTCATTG 


CAAAGGGTGC 


AATACAGTAT 


349260 


GGTAAACGCA 


ATTCCACACT 


GTTGTCCATT 


GCCCATGTTG 


CATTTTATTT 


TGCGGCGATC 


349320 


ATTGAGGCTA 


ATAAGCAAAA 


CCTGTCATTT 


AATAG CACTT 


CACAAATAGG 


GTTGGCGATT 


349380 


TTAATTTTTG 


CTATTGCAAT 


GCTATTTTAT 


GTGATTTATG 


AG CT AAAAG A 


GATTTGGACA 


349440 


GTAAAAATTT 


ATATTTTACC 


AGAACATCAA 


ATTAATCGAT 


CTTTCTTGTT 


TAAATACGTA 


349500 


CGTCACCCAA 


ATTATTTCTT 


AAACATTATT 


CCCGAATTAA 


TTGGTTTATC 


GCTTTTCTGT 


349560 


CAGGCTAAAT 


ATACCGCATT 


GGTTGGACTA 


CCTATTTATT 


TGCTTATCTT 


AGCCGTGCGT 


349620 


ATTAAACAAG 


AAGAGAGTGC 


CATGTCGCAT 


TTATTTCCTA 


AATCATAATT 


TATAAATTTC 


349680 


TTCTAAAAGA 


ACGACTCTTT 


CTCTCTGCCT 


TTAACACATT 


ATGGTAAAAG 


ATACTCTATT 


349740 


TTCTACCCCC 


ATTGCTAAAT 


TGGGGGATTT 


CATCTTTGAC 


GAAAACGTTG 


CTGAAGTCTT 


349800 


TCCAG ATATG 


ATTCAACGTT 


CCGTGCCGGG 


CTATTCTAAC 


ATTATTACTG 


CAATCGGTAT 


349&60 


GCTGGCTGAA 


CGTTTCGTCA 


CGGCTGATAG 


TAACGTTTAT 


GATCTAGGTT 


GCTCACGAGG 


349920 


AGCTGCCACA 


CTTTCTGCAC 


GTCGAAATAT 


TAATCAACCC 


AACGTnAAAA 


TTATTGGTAT 


34 998 0 


CGATAATTmC 


TCAACCGATG 


GTTGAACGTT 


GTCGCCAACA 


TATTGCGGCA 


TATCATAGTG 


350040 


AGATACCAGT 


AGAAATTCTC 


TGTAATGATA 


TTCGCCACGT 


TGAAATTAAA 


AATGCCTCAA 


350100 


TGGTCATTCT 


CAACTTCACC 


TTGCAATTTT 


TACCGCCTGA 


AGATCG CATC 


GCATTGCTTA 


3 50160 


CCAAAATCTA 


TGAAGGTTTA 


AATCCAAATG 


GCGTATTAGT 


ACTTTCTGAA 


AAATTCCGTT 


350220 


TTGAAGATAC 


TAAAATTAAT 


CATTTACTCA 


TTGACTTGCA 


CCATCAATTC 


AAACGTGCCA 


350280 


ATGGTTATAG 


CGAATTAGAA 




a a r*r*r* 7\ 
AAHjt„ACCGC 


AC TTG AAAAT 


GTGATGCGTA 


350340 


CAGATTCTAT 


CGAGACACAC 


AAAGTGCGGT 


TAAAAAACGT 


AGGATTTTCA 


CAAGTAGAAC 


350400 


TTTGGTTCCA 


ATG CTTTAAT 


TTTGGCTCGA 


TGATTGCGGT 


TAAATAAAAC 


AACGGTAATT 


350460 


TGATCnTCTT 


GCTTGCATAC 


AGCAATTGAA 


ATGATTAGTA 


TATACTTATT 


TAAATACATA 


350520 
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GTATATACGA GAGGGTAAAT ATGCTTACTA AAGTGTTTCA AAGTGGTAAC AGCCAAGCTG 350580 
TTCGGATCCC GATGGACTTT CGTTTTGACG TCGATACCGT AGAAATTTTC CGAAAGGAAA 350640 
ATGGGGATGT GGTATTACGC CCAGTTTCTA AAAAAACAGA TGATTTTcTT G CGTTATTTG 350700 
AAGGATTTGA TGAGACCTTT ATTCAAGCAC TTGAAGCGCG TGATGATTTA CCGCCTCAGG 350760 
AGCGAGAAAA TTTATGATTT ATATGTTAGA CACCAATATC ATTATTTATT TAATGAAAAA 3 50820 
TCGCCCCAAA ATTATTGCCG AACGAGTATC ACAATTATTG CCTAATGATC GCTTAGTTAT 3 5 0880 
GAGCTTTATT ACTTATG CTG AACTTATTAA AGGCGCCTTT GGTAGTCAAA ATTATGAGCA 350940 
ATCAATACGA GCAATAGAAT TACTTACTGA ACGAGTGAAT GTACTATATC CCAATGAACA 351000 
AATCTGTTTA CATTATGGCA AATGGGCAAA TACACTCAAA AAAC AAGGG C GACCTATCGG 3 51060 
AAATAATGAT CTATGGTTCG CTTGTCACGC ATTGAGTTTA AATGCCGTTC TTATTACACA 351120 
TAATGTAAAA G AATTTCAG C GAATTACAGA TCTTCAGTGG CAAGATTGGA CAAAATAGAT 351180 
AATTCTGATT TTTTATTTTA CTTATGACAT AATCTGTTTA ACTCAACTAA AAAAGGACAA 3 51240 
ACCAATGCAA ATTTTACACA CTATGTTACG CGTGGGCGAT TTAGATCGTT CAATCAAATT 351300 
TTATCAAGAT GTTTTAGGTA TGCnCTTATT ACGCACCAGT GAAAACCCAG AATACAAATA 3 51360 
TACGCTGGCT TTTTTAGGTT ACGAAGATGG CGAAAGTGCG GCAGAAATTG AATTAACATA 351420 
CAACTGGGGC GTAGATAAAT ATGAACATGG CACAGCATAT GGACATATCG CTATCGGCGT 351480 
GGATGATATT TATGCGACTT GTGAAGCCGT TCGCGCGAGC GGTGGTAACG TTACTCGCGA 351540 
AGCAGGGCCA GTTAAAGGTG GCTCAACCGT CATTGCTTTT GTTGAAGATC CAGATGGTTA 351600 
TAAAATTGAA TTTATCGAAA ACAAAAGTAC TAAATCTGGT TTAGG CAACT AAAGTGCGGT 351660 
CAATTTTTAT TGTATTTTTA ATAACGTAGG GATTCCTACG TTATTTTATT TTTAAGAGAT 3 51720 
AGATTATGTC CGATTCTCAA GAAATCCCTT ATCACAACCA ATTAAAAAAT CGCTTTCGTG 351780 
GCTATTTCCC CGTCATTATT GATGTTGAAA CTGCAGGTTT TGATGCAAAA AAAGATGCGT 3 51840 
TACTCGAACT AGCCGCCATC ACATTAAAAA TGGATGAAAA CGGTTATTTG CATCCTGATC 3 51900 
AGAAATGCCA T TTT CAT AT C AAGCCTTTTG AAGGCGCAAA TATTAATCCA GAATCCTTGA 3 51960 
AATTTAATGG TATTGATATT CACAATCCAC TACGTGGCGC GGTGTCTGAA CTTGATGCAA 3 52 02 0 
TCACGGGATT ATTTCAAATG GTGCGTCGAG GACAAAAAGA TGCGGATTGC CAACGTTCTA 3 52080 
TCATTGTGGC GCACAATGCC GCTTTTGACC AAAGTTTTGT GATGGCTGCC GCTGAACGCA 3 5214 0 
CTGGCGTAAA GCGTAATCCC TTCCACCCTT TTGGCATGTT TGATACCGCA AGTTTAGCGG 3 52200 
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GCTTAATGTT TGGTCAGACT GTTCTCGTTA AAGCCTGCCA AGCGGCAAAA ATTCCTTTTG 3S3 2 ,0 
ATGGTAA^CA AGCCCATTCA GCGTTATATG ATACCGAACG CACGGCGAAA TTATTCTGCT 35*330 
ATATGGTCAA TCATTTAAAA GATTTAGGTG GTTTCCCACA TATTGCGTCT GAmCTAGAAC 353300 
AGGAAAAAAC AACTGAAAAA GAGACCGCAC TTTAAGAAAA ATAAAAAACT GGTTGCAATC 35,440 
GTTTTAT^ TGCAGTAGTT TAAAAGCAAT TCTTATTTGG AGATAAAAAA TGAACATTAT 35*500 
CCGTCGTCAC CATCATCACC ATTCAACTGA ATAATCTrTC AGGTTTTTGG TGTGCTTGGA 35=5,0 
AGATTTCTTC CGAGCTGACA GGATAATAAA AATCTTACCC CTCGGAAGGC AACTTTCGAG 353,30 
GGGTTTTGTT TTATTTACGA TATAGGAGGC ATTAATTTAG CGTTATTCTT TATTCAATAA 353,00 
AAATTTTAAA GTAAAGTATT TATCATTTAA TTTACAAGGA AAAACACAAT GTTATCAAAC 353,40 
CCTGTTGTTA TCTCAATCAT CGTTTTGCTG GCACTTAGTT TGCTACGCAT CAACGTTATT 353SO0 
ATCGCACTGG TGATTGCTGC GCTTACCGCA GgATTTATCG GTGATTTAGG TTTAACTAAA 3530,0 
ACCATCGAAA CCTTTACAGG CGGCTTAGGT GGCGGTGCTG AAGTCGCAAT GAACTATGCA 3=3,30 
ATACTTGGAG CGTTCGCCAT TGCTATTTCT AAATCAGGAA TTACTGATTT AATCGCTTAT 3S3 98 „ 
AAAATGATCA CAAAAATGAA TAAAACACCA ACTGCTGGCA ATTTAACTTG GTTTAAATAT 3530,0 
TTCATTTTTG CGGTATTGGC ATTGTTTGCC ATCTCCTCTC AAAACTTACT TCCAGTGCAT 353300 
ATTGCTTTTA TTCCAATCGT TGTACCACCA CTACTTTCAA TCTTTAACCG TTTAAAAATT 3533,0 
GACCGCCGTG CAGTAGCTTG CGTACTTACA TTTGGTTTAA CTGCTACCTA TATATTATTA 353330 
CCAGTAGGCT TTCGAAAAAT CTTTATTGAA AGTATTTTAG TTAAAAACAT TAATCAAGCT 353300 
GGCGCAACCT TAGGTTTACA AACAAATGTA GCACAAGTTT CTTTAGCAAT GTTGTTACCT 353340 
GTTATTGGAA TGATTTTAGG CTTACTCACA GCCATATTTA TTACCTACCG TAAACCACGT 353400 
GAATACAATA TCAATGTTGA AGAGGCAAGA ACTAAAGATA TTGAAGCTCA CATTGCTAAT 3534,0 
ATTAAACCAA AACAAATTGT TGCAAGTTTA ATCGCGATTG TCGCGACCTT CGCAACACAA 3535=0 
TTAGTCACAA GTTCAACTAT CATTGGCGGT TTAATTGGCT TAATTATTTT CGTACTATGT 353500 
GGCATTTTCA AACTAAAAGA AAGCAACGAT ATTTTTCAAC AAGGCTTGCG CTTAATGGCA 353540 
ATGATTGGCT TTGTAATGAT TGCTGCTTCA GGTTTTGCAA ATGTGATTAA TGCCACCACT 353,00 
GGCGTAACAG ATTTAGTACA AAGCCTAAGT TCTGGTGTTG TGCAAAGTAA AGGTATTGCC 3537,0 
GCATTATTAA TGCTTGTCGT TGGTTTATTA ATTACAATGG GTATTGGCTC ATCATTCTCT 353030 
ACAGTACCGA TTATTACATC AATTTACGTA CCACTCTGTT TATCTTTTGG TTTTTCTCCA 353000 
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TTAGCGACAA TTTCTATTGT AGGCGTAGCA GCGGCTTTAG GAGATGCAGG TTCGCCCGCA 3 5394 0 

TCAGACTCAA CATTAGGCCC AACATCTGGC TTGAATATGG ATGGCAAACA CGATCACATT 3 54000 

TGGGATTCTG TTGTACCTAC CTTTATTCAC TATAATATTC CATTATTGGT ATTCGGCTGG 3 54 060 

ATTGCAGCAA TGTACCTTTA ACGATTATAA TCCCCCACTT GGGGGATTTT TTTATGACTG 3 54120 

TTCAACAACT TATCCAACGC TT AG AT C AAA AAGTTCAACA ACTTTATCAA GCTCATTTAT 3 54180 

CCAAACGAGA AGAAAAAATC TTCGCCAAAT TTGACCGCAC TTTGTTCAGT GAAAATGGAC 3 5424 0 

AAAATGTTTC CTTTTATCTA AAAGAAATCA ATCAAACGCT AGATAGAATA AAGACATTAG 3 54 300 

AGTCTAACGA TTCTAATCAC TATAATTTTC TAG CTG AG CG TTTACTCGCC AATGTTCCGT 3 54360 

TCTTTCGGAA GCTTTAGTTC G C AAAAAT AC TCATCTAACT GAATCCCAAA CAACAACGAA 3 544 2 0 

ACAAACCATT CAAAAATCTC AACAT AG CAT TCATAAATTA CCAC CAAGAG AACGCCTAGA 3 544 8 0 

AAAATATTAC GAAGCACGAG AACAATTGAA TAATCTTTAT CGACAGCATA AAGATTTAGC 3 54 54 0 

ACAAG CTGAA AAAAATAATG ATGAGaAAAT ACGTTATGCT CAACTTGCAG AAGTTTATAA 3 54 600 

AAAACGCCAG CAAAAATGCC AAGATGCAAT CGATCTTTTA GAAGAATATT TGGTGTTTAA 3 5466 0 

AGAAGAAGTG GAAAACCGTG AAAATACGGA AAACAAATGA TTAGAAATAA AAAACTAATT 3 54 72 0 

CTTCCGAATT AGTTAGCTCG TAAAAATTGA TGGCGGAGGA AT AGGG ATT C GAACCCTAGG 3 54 78 0 

AGGGCGTAAA CCCTCGCCGG TTTTCAAGAC CGGTGCCTTC AACCACTCGA CCATTCCTCC 3 54 84 0 

GTGATTTAAC GAGCGTGAAT AATACGTTCT CTAATAAAAA CCGTCAAGTA GAAATAATCT 3 54 90 0 

CATTTTTATC AAGTGATTAT TTTTACCACA GCTTATAAAA AATGCCTTTC CTATTTATGA 3 54 96 0 

ACGGAAAGGC ATTTTTGTTG AATGAATTTC AGTGCTTTTT ATTACTACAA TATATCGCAG 3 5 502 0 

AAATTAACCG CACTTTTGTC ATCTAAACAA TTATTTTACG CGAGAGACGT ATTCGCCAGA 3 5 5080 

GCGGGTATCA ACTCTGATAA CTTCGCCGAT TTGGACGAAA AGCGGCACTT TTACAACTGC 3 5 514 0 

GCCTGTGCTT AATGTTGCAG GTTTACCGCC AGTGCCTGCG GTATCGCCTT TAAGAC CTGG 3 5 5200 

GTCTGTATCG ACGATTTCTA ATTCAACAAA GTTTGGTGGC GTAACAGTAA TTGGTGCGCC 3 5526 0 

ATTC CAT AAA GTCACGATAC AATCTGCTTG GTCTAATAAC C ATTTTT CTG CATCGCCTAC 3 5 532 0 

GGCTTTTGCA TCTGCAGAGT ACTGTTCAAA TGTTTCTGGG TGCATAAAGT ACCAGAATGC 3 5 5380 

ATCGTCTTTG TATGAATAAG TTAAGTTAAG A t C CATAAC A TCAGCTGCTT CAACCGAAGT 3 5 5440 

GCCAGATTTA AAGTTTACGT CTAATACTTT GCCTGAAATT AATTTACGAA TACGAGTACG 3 5 5500 

AGTGAATGCC TGACCTTTGC CTGGTTTTAC AAATTCATTT TCAACGATGA CACAAGGCTC 3 55 56 0 
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GCCGTCTTGC ATAAATTTTA GACCTGGTTT GAAATCACTG GTAGTATATG TAGCCATATT 35 5620 
TGAAATATCC TAAAAATGAG AGATGGTTTG AAAAGTGCGT ATTTTACC CC AAGAACCCGT 355680 
CATTAGAGAA GAACAAAATT GGCTCACAAT TCTAAAAAAT GCCATTTCAG ATC CTAAATT 355740 
ATTACTAAAA GCCTTAAATT TACCAGAAGA TGATTTTGAG CAATC CATTG CTGCGCGGAA 355800 
ACTTTTTTCG CTCCGCGTGC CACAACCTTT CATTGATAAA ATAGAAAAAG GTAATCCGCA 35S860 
AGATCCCCTT TTCTTGCAAG TGATGTGTTC TGATTTAGAG TTTGTGCAAG CGGAGGGATT 355920 
TAGTACGGAT CCCTTAGAAG AAAAAAATGC CAATGCGGTG CCAAATATTC TTCATAAATA 355980 
TAGAAATCGC TTGCTCTTTA TGGCAAAAGG CGGTTGTGCG GTGAATTGTC GTTATTGCTT 356040 
TCGCCGACAT TTTCCTTACG ATGAAAACCC AGGAAATAAA AAAAGCTGGC AACTGGCGTT 356100 
AGATTACATT GCGGCACATT CTGAAATAGA AGAAGTGATT TTTTCAGGTG GCGATCCTTT 3 56X60 
AATGGCGAAA GATCACGAAT TAGCGTGGTT AATAAAACAT TTGGAAAATA TACCGCACTT 3 56220 
ACAACGTTTG CGTATTCACA CCCGTTTGCC TGTTGTGATT CCGCAACGGA TTACTGATGA 356280 
ATTTTG CACT TTATTAGCAG AAACTCGTTT GCAAACAGTT ATGGTGACAC ACATTAATCA 356340 
CCCGAATGAA ATTGATCAAA TTTTTG CTCA TGCGATGCAA AAATTAAACG CCGTGAATGT 356400 
CACGCTTTTG AATCAATCTG TTTTGCTAAA AGGCGTGAAT GATGATGCGC AAATTCTAAA 356460 
AATATTGAGC GATAAACTTT TTCAAACAGG CATTTTGCCT TATTACTTGC ATTTGCTGGA 356520 
TAAAGTTCAA GGGGCGAGCC ATTTTTTGAT TAGCGATATT G AAG CTATGC AAATCTATAA 356580 
AACCTTGCAA TCTCTGACTT CTGGCTATCT TGTTCCTAAA CTTGCACGAG AAATTGCGGG 356640 
CGAGCCAAAT AAGACTTTAT ACGCAGAATA AGATCCGATA AAATACACAT AATTTTTTCG 3 56700 
CCGCACTTTT GAGCTTCTCA ATTTTGTGGC TAATTAATAT AAATAAGAAC AAGCATCAGA 356760 
GGCAATCCAG TGAATTCTAT GGATAAAAAT CaACAATCAT CTCAAAATGA ATTGGATTTA 356820 
GGGCTTAATC AAG AG CC AAT TACACCAAAG AAAACAATTC AGCCAAGTTC GTCAATTTTA 356880 
GGTAAAGCAA AAGGGTTATT TGCCAAAAAA AATCACGTGC AAACTAACTT TCAGC AACG T 356940 
AAAGAACCTA CTTTTGGCGA TTCATCAACG CAAGAAAATG ATCCTTTAAT TCCGAGTGAA 357000 
AATTTGAAAA AAGTG CAAAA GCCTGTTCTT CAAACTTCTT CCACAGAAGA AAATATTTCT 357060 
GCGGTTGATG AAGAAATCAG TGCTGAAAAT AACGCGGATG AACCCGTCGA AAAGGCTGAA 3 57120 
AAACCTATTT TAGCTCAACC AGAAAAATGG AAAATATTAC AAGTATTGCC AGCAAAACAT 3 57180 
CGCCGTTTAT TTATGG CTAT TTTTGTGTTG GTTATTTTAT TGATTATTTT CTTCGCATTA 357240 
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AAACCAAGTT CTGACACCGT TGAGTCTTTT ACTCAATCTA ACAGCAATGA AGTTCCTGTG 3 S 73 00 
CAGTTCCAAT CGTTAGATCA AAGTCAGCCA CTGGAAACCA CGATTTTAGA TAATCCTCCT 3 57360 
GCACAAAATC AAATGGCTGT AGAACAAGCT AACCAATCTG AATTTGCACC AAAAGCAGAG 3 5742 0 
GAAGCGGCGA ATAATACGAC GGCTCAAAAC CCATTAGTAG AAAATGCGCC AATGCAACAA 3 574 80 
AATGTTGTTC AATCTCCAAG TCAAATGCCA AATGAAATGG CTGCGGCATC TGTTGCGCCT 3 57540 
ATGCAACCAG CTCAAGCGGA ACAGCCAAAA GCAACTGTGC CAGTTCAGCC GATGAAAAAA 3 57600 
GCGGTAGAGC CACAAGTTGC GCATAAAGAT ACAGTGAAAA AAGAAGTGAA AGTGG CGGAG 3 5 7660 
AAAGCACAGG CTCCAGCAAA AGCAACGGAA CAAAACGTCG CTAAAACAGC AGGAAACGCA 3 57720 
CCGATTGTTG AAGCCAAACC TGTTCAAGCT AAAAAAGAAA AGAAAGTTCA AATCGTTGAT 3 57780 
GCAAAACCTG TGAGTAAATC TACAGCTTCT CGCCTTTCAG CAAAAACATT AACTGTGCCG 3 57840 
AAAGGTGTTT CGCTGATGCA ACTCTTCCGT GATAATCAAC TTAATATTTC CGATGTAAAC 3 57 900 
GCAATGAGCA AAGCGACAGG GGCGGGAAAT GTTTTAAGTA GCTTTAAATC TGGCGATAAA 357960 
GTAACGGTAT CTGTGAATAA TCAAGGGCGA GTAAATGAAA TGCGTTTATC C AATGGTG CG 3 58020 
CGTTTTGTGC GTCAGTCTGA TGGTT CTT AT CAATATAAAA AATAATTTTT TGGGCGAGTA 3 58080 
ATCGCCCCTT TTTCTTTTTA AGTTCTTTTC TCAAGGTAGG ATTTATGTTG AAAAAAACAT 3 58140 
CTCTTATTTT TACCGCACTT TTAATGACTG GCTGTGTGCA AAA t G CGAAT GTAACAACAC 3 5820 0 
CTCAAGCGCA AAAAATGCAA GTAGAAAAaG TGGATAAAGC CTTACAAAAA GGCGAAGcTG 3 5826 0 
nTCGATATTT ATGTCAAGAT GATAGAGTTG TTCGTGTTGT ACACGCCACG CATAAAAAAT 3 58 32 0 
ACAAAAAAAA TTTGCATTAT GTTACTGTCA CTTTTCAAGG CGTATCAGAA AAACTAACCT 3 58 38 0 
TAATGATTTC TGAACGTGGT AAAAATTACG CCAATATTCG TTGGATGTGG CAAGAGCGTG 3 5844 0 
ATGATTTTAG TACGCTAAAA ACGAATCTCG G CG AAATTTT AGCAACGCAA TGTGTCTCAC 358500 
AAACAAGTGA ACGCTTATCT GGACAATAAT CGTGCAAAGC GAACTTCAAC GTGGTTTTGT 358560 
TTTGCATCGT CGCCCTTATA GTGAAACGAG CTTATTGGTA GATTTATTCA CTGAAGAAAG 3 58 62 0 
TGGGCGTTTA ACCGTGATTG CAAAAGGGGC GCGGGCAAAG CGTTCATCTT GGAAATCGGT 35868 0 
TTTGCAACCT TTCACACCTT TATTGTTGCG TTGGACGGGA AAAAGCACAT TGAAAACCCT 3 58 74 0 
CACTAAAG CT GAACCTGCCG CAATTACACT GCCTTTACAA CAAATCGCCC TTTATAGTGG 358800 
ATTTTATGTA AATGAATTAC TCACTCGAGT TATTGAGTCT GAAACGCCAA ATCCTGCACT 358860 
TTTTCAACAT TATTTAAAAT GCTTAACGGG CTTGGCAACG GAAACTAATA TTGAGCCAAC 358920 
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GTTGCGTCTT TTTGAATTTC AACTGTTACA AATTCTTGGT TACGGCGTGG ATTTTCTGCA 3S8980 
TTGTGCTGGT TCTGGCGAGC CAGTGGATTT CTCAATGACA TACCGTTATC GTGAGGAAAA 3 59040 
AGGTTTCATT GCGTCTTTAG TGAAAGATAA TCTCACGTTT TACGGCAGAG ATTTGTTAGC 359100 
CTTTGAAGCC CTTGATTTTT CCGATGATGC GGTACGTCAA GCGGCAAAAC GTTTTACTCG 35 9160 
TATTGCACTG AAACCTTATC TTGGCGATAA ACCGCTAAAA AGCCGAGAGT TATTCACTCA 3 5 92 20 
AAATATTTTA CTTTTAAAAT AATGGTTCTT CTTTATACCC CAAAGCAAAA AACAAAAAAT 3 59280 
GTTCAAACCA TTACGGCTGA TATTCTTGAT TTAGATTATC AAGGTTTAGG CGTGGCGAAA 3 5 9340 
ATTAATGGTA AAACTTGGTT TATTGAAAAT GCGCTTCCGC ACGAAAAAGT GGAATGTCGT 3 5 9400 
ATTTTGGAAG ATAAACG CCA ATATGGACAC GCCATTGTAA AAAAATGGCG TGTAAAAAGC 359460 
CCTGAACGTT TAGAACCTAA ATGTGCTCAT TTTATGCGTT GTGGTGG TTG CCAAGGACAG 3 5 9520 
CATATTCCTA TTGAAATGCA GAGAAAAGCG AAAGAATCTG CATTGTTTAA AAGATTAAGT 3 5 9580 
AAATTACAGT CAGAACCGAT TTCATTTCAA CCTATGATTT GTGGCGATGC TTGGGCGTAT 3 5 9640 
CGTCGTCGTG TGCGTTTGAG TTTATGGTTT AACCCAAGCA CAAAACAAAT CGAAATGGGC 3 5 9700 
TTTCGTCAGA AAAATACCAA CGATTTAATA CCCGTTCAAT CTTGCGAAGT AGCAGAACCT 3 5 976 0 
GCAATTAATT ATTTACTGCC TAAATTAACC GCACTTTTAG AAAAATTTTC AGCACCGAAA 3 5 9820 
CAGCTTGGGC ATATTGAATT GGTGGCTGCC GATAATGGTG TCGCAATGTT GTTACGTTAC 3 5 9880 
AC CAAAAACC TTGCGGAAAT TGACCGCACT TTATTGCTTA AATTTGCTGA ACAAGAAAAG 3 5994 0 
TTAATGCTGT TTTTACAAAG CGACAAAGGC ATTGAACAAA TCTATGGCGA TGCACCTTAT 360000 
TATCAATTTT CTGATGGCAT AAAATTACAT TTTGATATTC GTGATTTTAT TCAAGTAAAT 3 6006 0 
AGTGCATTAA ATGAAC CT AT GGTCAATACA GCGTTGGATT GGCTTGAATT GAGCCAACAG 360120 
GATTGTGTGT TAGATTTATT TTGCGGCATG GGAAATTTTA CGCTTCCTCT CGCCAAACGC 360180 
GTGAAAAGTG CGGTAGGAAT TGAAGGTGTT TTTGAAATGG TACAAAAAGC AGCGCAAAAT 3 6024 0 
GCCGAGAGAA ACCAAATCAA AAATATTGAA TTTTTCCAAG CGGATCTTGA CCAATCCTTT 3 60300 
GTTGAACAAC CTTGGGCTAA TCAGTCGTTC AATAAAATTT TGCTTGATCC TCCACGCAGT 36 036 0 
GGCGCAGCTT TCGCGTTAAA TGCGTTGTGC GAGTTGAAAG CAGAGnAAAT TTTATATGTG 36042 0 
TCTTGTAATC CCGCAACATT AGTGCGCGAT GCGGAAATTT TATGTAATTT TGGCTACAAG 3 60480 
ATTGAAAAAA GTGCGGTGAT AGATATGTTC CCTCATACGG GGCATTTGGA ATCGATAACC 3 6 0540 
TTATTCACTA CAAAATAACT CGGTTATTCG GTACAATATC CACCCAAAAT AAAAGCGACA 3 6 0600 
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GGAAGTTGCT TGCTCTTCAT CAACAAGGGG GTATTATGGT TGCTGTTCGT GGTTCTCATT 360660 
TGTTAAATCC GCAGGATTTT GTGATTGAAC AATGGTGCAC TGG CTTAAAA CTAGCAGaAC 360720 
AAACAGAAAA AAGTTTGAnC GATGCGTGGT ATTACGCACG TGATTTAATG AATGCCTACC 3 6078 0 
CCGATGAAAT GAAGAATGCG ACCTTGATGC TGCAGTCTGG CGTGGAAATG GTAGAAATTT 360840 
TGCACGAACT TAATATGGAT GCGGAAACAT TGCTTACTGC CATGTTGTTT CCTATTGTTG 360900 
CAAATAAATT AACAGACTGG GAAAGTCTAA AAGAAAAATT TGGTGCGAAA ATTACTAAAT 360960 
TACTTAAAGG CGTGTTGGAG ATGGATAATA TTCGCCAACT TAATGCCAGT CATTCCGCTA 3 61020 
ACGCTCTGCA AGTGGATAAT GTTCGCCGTA TGTTGTTGGC GATGGTGGAC GATTTCCGCT 36X080 
GCGTGATCAT TAAACTTGCC GAACGTATTA CTTTCTTGCG TGATGCAGAG CATCGTTGTG 3 61140 
CAGAGGAAGA CAAAGTCCTT GCCGTCAAAG AATGTTCTTA TATTTATGCC CCTCTTGCCA 361200 
ATCGTCTTGG AATTGGTCAG CTGAAATGGG AGTTAGAAGA TTACTG CTTC CGTTATCTTC 3 612 60 
ATCCTGAACA ATATCGCGCT ATTGCTAAAT TATTGCAGGA ACGTCGTCTT GATCGTGAAC 3 613 20 
ATTACATTGC TGATTTCGTC AGTGAGTTGA GCGGTTATTT ACGTGAAAAT ATCGAGCAAG 3 613 80 
TGGAAGTCTA TGGTCGTCCG AAACATATTT ACAGCATTTG G CGTAAAATG CAGAAAAAGC 361440 
ATCTTGAATT CAGCGGTTTA TATGACGTAA GAGCGGTCAG AATTATCGTG CAAAAATTAC 361500 
AAGATTGTTA CACCGCACTT GGGATTGTTC ATACTCAATT TAAACACTTA CCCAAAGAAT 361560 
TTGACGATTA TGTGGCGAAC CCGAAACCGA ACGGTTATCA ATCTATTCAT ACTGTTGTTT 361620 
TAGGTAAAGG TGGCAAGCCA ATTGAAGTGC AAATTCGAAC CCAACAAATG CATGATGATG 36168 0 
CAGAGTTAGG TATGGCGGCA CACTGGAAAT ATAAAGAAGG CAATACAGGC AGTATGTCTG 3 617 4 0 
CCTATGAAGA AAAAATTGCG TGGTTGCGTA AATTGTTAGC TTGGCAAGAT GATATTACGG 3 618 00 
ATTCAGG CG A AGTG CTGGC A GAGTTGCGTA nCAAGTGTTT GATGATCGTg TATATGTGTT 36186 0 
TACGC CAAAG GGCGAAGTGG TGGATTTGCC AACGGGATCT ACACCATTAG ATTTTGCTTA 36192 0 
TGCCATTCAT AGTGAAATTG GTCATCGCTG TATCGGGGCA AAAGTGGCGG GGCGTATTGT 361980 
GCCATTTACC TATCAGCTAC AAATGGGCGA TCAAATTGAT ATTATTACCC AGAAAAATC C 36 2 04 0 
GAACCCGAGC CGTGACTGGC TAAACCCAAA TTTAGGATTT ACTCACACCG CAAAATCTCG 362100 
AGCCAAAATT CAAGCGTGGT TTAAAAAACA AGATCGCGAT AAAAATATTC CAGCGGGTAA 36 216 0 
AGAATTACTC GATAATGAAC TCGCGCTTTT GAATTTAAGT ATTAAGCAAG TGGAACCTTA 36222 0 
TGCGTTACCA CGCTATAACT TGAAAAATTT GGACGATCTT TATG CGGGCA TTGG CAGTGG 3622 80 
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TGATATTCGT TTAAATAACT TAATCCATTT CTTGCAAAGT AAATTGATTA AAGTGACTGC 36 2 34 0 

TGAAGAGGCT GAT CAAG AAA TTTTACGTCA TGTTGCTAAC AAAAGTGCGG TCAATTCTCA 3624 00 

GCAGAAATCT GAAAAGAACA ATGGCTGTGT GATTGTGGAG GGTGTGGACA ATCTTATGCA 362460 

TCATATTGCG CGTTGCTGCC AGCCGATACC AGGCGATGCG ATTGTTGGCT ATATTACAAT 3 6252 0 

GGGACGTGGT ATTTCTATTC ACCGAGCGGA TTGTGAACAA TTCTTAGATT TGCAAGCGGC 3 62 580 

TCATCCAGAA CGCGTAGTGG AATCAATTTG GGGCGAAAAT TACG CAAGTG GTTTCCATAT 3 62640 

TAATATCCGT ATTGTGGCAG GCGATCGCAA TGGCTTATTG CGTGATATTA CAACGGTTCT 3 62 700 

AGCAAATGAA AAAATTAGCG TATTAGGGGT CTCAAGCCGT GCTGATACCA AAAAACAGCT 362 760 

TGCTACTATT GATATGGAAA TTGAACTTCA TAATGTTGAA AGTTTGAGTA AAATATTGGC 3 62 82 0 

TCGATTAGCA AAATTAGATG ATGTTATCGA AGCGAAGCGT TTATAAAAAT AGGAAAAwTT 362 880 

ATGTATAAAA CGACGGGGCT AACCCATTTA ATCAACTCCA CTAAATATTC ATTACAAGGC 3 62 94 0 

TTAAAAAGTG CGTTTAAAAA TGAAACGGCT TTTCGCCACG AATGTTTTTT AGCCTGCATT 3 63000 

TTAATTCCTC TTACTTTCTT TTTAGGCGAA ACAAAAATTG AAATTATATT GATGATTTCT 3 63 060 

TCTGTTTTGC ttgxgatgqc QTTAGAACTT TTAAACAGTG CGGTAGAGAC AGTTGTTGAT 3 6312 0 

CGCATTGGTA CAGAACGCCA CGAACTTTCA GGACGTGCGA AAGATCAAGG TTCTGCATCG 3 63180 

GTATTTATTG CCCTTTGTAT TGTCGGTATC GTTTGGGGCG GAATTTTATT CTTCTAACCC 3 63 240 

TCCAAAAGTG CGGTAAGTTT TTACCGCATT TTTGATTTAT CCAATCACGT CAAAATCATT 363 300 

TATAAAAATA AGGAAAAATA TGACCGCACT TTTAAAAATT GGTCTGGTGT CTGTTTCTGA 3 63 36 0 

TCGCGCATCA GCAGGCGTGT ATCAAGATCA AGGTATCCCA GAATTACAGG CTTGGTTAGA 3 6342 0 

ACAAGCGTTA GTAGATCCTT TCCATTTAGA AACAAGATTA ATCCCCGATG AACAGCCAGT 363480 

TATTGAACAA ACCTTAAAAG AGCTGGTGGA TGAaCAAGGT TGCCATTTAG TGCTGACAAC 3 63 54 0 

TGGTGGAACT GGGCCAGCAA AACGCGATGT AACGCCTGAT GCAACCCTTG CGGTAGCGGA 3 6 3600 

TCGAGAAATG CCAGGTTTTG GCGAACAAAT GCGTCAAGTG AGTTTGCATT TTGTGCCTAC 3 6 366 0 

CGCAATTTTA TCTCGCCAAG TGGGTGTAAT TCGTAAGGAA AGCCTGATTT TAAATCTTCC 36 3 720 

TGGGCAACCT AAAGCGATTA AAGAAACCTT AGAGGGCGTA AAAGATAAAG AAGGAAATGT 363780 

GCTTGTAAAA GGCATTTTTA GCGCAGTGCC TTATTGTTTG CAACTGATTA ACGGCTTATA 36384 0 

TATTGATACC AAGC CTG AAA TCATCGAAAG TTTCCGTCCT AAATCTGCAA GACGCGAAAA 36390 0 

TCTGGAGAAA TAGCATGAAA AAAATCGAAG CAATGATTAA ACCCTTTAAA TTAGACGATG 36 3 960 
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TGCGAGAAAG TCTTTCAGAT ATTGGTATTT CAGGTATGAC AATCACAGAA GTACGCGGAT 364020 
TTGGTCGTCA AAAAGGTCAT ACAGAACTTT ATCGTGGTGC GGAATATATG GTGGATTTTC 364080 
TGCCGAAAGT GAAATTGGAA GTGGTAGTtC CTGATGAGCT TGTGGATCAA TGTATTGAAG 364140 
CGATTATTGA AACGGCACAA ACAGGTAAAA TCGGTGACGG CAAAATTTTT GTTTATCACG 364200 
TTGAGAGAGC CATCCGCATT CGCACGGGCG AAGAAAACGA GGATGCGATT TAGTGCAAAA 364260 
TAATTTAAAT TTACACCGCA CTTTGCTTGG CATTGCTGCG GTGATTATTA TCTTAGCTGG 364320 
CGTAAAATTG GCGG C AG AAA TCGTGGTGCC ATTCTTGCTA TCACTCTTTA TCGCGATAAT 364380 
TTGTTCGCCG ATAATTAAGG CGATGACCCA ACGCCGCGTC CCGCATTGGT TAGCCATTAC 364440 
TTTGCTTTTT GTCTTGATTT CTTTGGTATT TTTCTTCTTA GTCGGGTTGA TTAACAGCAC 364500 
GGCAAGAGAA TTTACCCAAT CTATTCCACA ATATAAAGTT TTGCTTTCAC AACGCGTGAG 364560 
TGATTTAACT GGATTGTTGC AACGTTTCAA TCTGCCTTTC AcgCTTTCGC GTGAAACTAT 364620 
CCAAGAAAAT TTTGACCCAA GCATCATTAT GAATTTTGTA AGCCGAGTAT TGCTGAATTT 364680 
CTCTGGTGTA GTGAGTAATG TGTTTGTTTT AGTGTTGGTC GTGATTTTTA TGCTTGCCGA 364740 
AGCACCAACG ATGAAACATA AATTTGCAAT GGTGATTAGT TCAACCCCTC ATGACGTAGC 364800 
CAAAGAAGAA CGCCATATTG ATCGCGTTTT ACAAGGTGTA ATTG GTTATC TTGGCATCAA 364860 
AAGCATCACG AGTTTATTGA CTGGTGTTGG AGTGTTTATT TTGTTGGAAG CCTGTAGGGT 364920 
TCAATATGCG ATTTTGTGGG CAACCTTGAG TTTCTTGCTG AATTATATTC CCAATATTGG 364980 
TTCTATCATC GCGGCTATTC CGATTATTGT GCAAGCCTTA TTACTGAATG GTTTTGGCAT 365040 
TGGCTTTGGC GTGGCAATCG GTGTTATTGC GATTAATATG GTTGTCGGTA ACATTATTGA 365X00 
ACCTAAAATG ATGGGGCAAC GATTAGGGCT TTCAACTTTA GTGGTGTTTC TTTCATTACT 365160 
GTTTTGGGGA TGGCTGCTTG GAACCGTGGG AATGCTACTG TCTGTTCCTT TGACGATGGC 365220 
ATTAAAAATT GCCTTGGAGT CCAGTCCCAA TACAGCGAAA TATGCCTGTC TATTAGGAGA 365280 
TGTCGAAGAT TTCAAATAGA TTTTAGAGAT AAAAGTGCGG TGAATTTTCA CCGCATTTTT 365340 
TATGATAAGT CTTGCGGATC GACGTCCAAA ATTAACCGCA CTTGGCTGTT TTTTTCTAGT 365400 
TCGGCTTGTT GATATTCTCT TAATGCTTTT TGCAACGTCA TTCTTGATGG ATGCTGCAAT 365460 
AATAACTGCC AGCGATATTG CCCCGCTTTT TTGCTGAACG GTGCAGGCAT TGGGCCAAGC 365520 
ATTTGTAATC CCGTAATTTG CTTCGATTGA AAAAAATCTG CAATTTGGCT TAAGCAATTT 365580 
TCTGCAAGAT CAGAATGTCG AGCCTGTGCT TTGATTAACG C CTG AAAAGT AAAAGGCGGC 36S640 
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AATCCCATTG 


AATGGCGTAA 


TTGTAAGGTT 


TCTTTTGCAA 


AAGCTTGGTA 


. GCCATTTGCC 


365700 


AATAAAGTGG 


TAAGCAAAGG 


ATGATCTGGA 


TAATGTGTTT 


GTAGCACCAC 


TTCACCTTGT 


365760 


TTATCGGCAC 


GTCCTGCACG 


TCCAGCAACC 


TGAATATAAA 


GTTGAGCTAA 


ACGTTCTTCT 


365820 


GCACGAAAAT 


CCAGTGAAAA 


TAATGCACTA 


TCTACATTCA 


CTAAGGCAAC 


GAGAGTAACA 


365880 


TTCGGGAAAT 


GATGTCCTTT 


CGCAAGCATT 


TGTGTGCCAA 


TCAAAATTTG 


ACTTTTGCCT 


365940 


TGTTGAATAT 


CTTCTAAATA 


ACCTTCCAAT 


TTTCCTTTAC 


GCGAAGTGCT 


ATCGCGGTCG 


366000 


ATTCGAGCCA 


CAGAATAATG 


AGGAAATAGA 


GTTTTCAAGG 


TTTCTTCTAA 


TTGTTCTGTG 


366060 


CCGAGACCTG 


TAGTGACTAA 


ATGAGTCGAG 


CCACAATCAC 


CACATTGTCG 


AGGAATAGTT 


366120 


TTCTGTGCAC 


CACAATGATG 


ACAACGCAAT 


ACATTTTGAT 


GCTGATGATA 


GGTATAAGGT 


366180 


TTTTCACAAT 


GCGGGCATTG 


TGCGAtCCAA 


C C AC ATT CAT 


GGCAAAG CAA 


TACGGGCGCA 


366240 


AAGCCACGAC 


GATTCAAGAA 


AAGCAACACT 


TGATTGCCTT 


TTTCGAGGTG 


GGCTTTCATT 


366300 


CGTTCCAACA 


AGGGTTTTGA 


AAGCCCGTTT 


TGTATATTTT 


GATTTTTCAA 


ATCAATAACG 


366360 


AAATGACGAA 


GTGCGGTAGA 


ATTTCCCGCT 


CTTTTTGATA 


AAACTAAATG 


TTGGTATTTG 


366420 


CCGTTTTGTA 


CATTATTAAT 


ACTTTCTAAA 


CTTGGCGTAG 


CAGAACCCAT 


TAATACTGAA 


366480 


ATGTTCAGTT 


TTTGAGCCAG 


TACAATCGCC 


AAGTCTCGCG 


CGTGATAACG 


CCAACTGTCT 


366540 


TGCTGTTTGT 


AAGACGAATC 


GTGTTCCTCG 


TCCAAAATAA 


TTGCACCAAG 


ATTAGAAAAT 


366600 


TGCGTGAACA 


ATGCCGATCT 


AGTGCCAATC 


ACAATAGCAC 


TTTGTCCAGA 


GCGGGCGCGA 


366660 


TCCCATACAT 


AAAGCCGTTG 


CGTATCGGTT 


AAATTAGAAT 


GCAGCACATC 


AATTTCTACA 


366720 


TTGAAACGCA 


CTTTGAAACG 


TTGCACCGTT 


TGTGGTGTTA 


GCCCGATTTC 


GGGTACAAGC 


366780 


ACTAATACTT 


GTTTGCCCGA 


TTTTAAAATT 


TCTTCAATAT 


ATTG CAG AT A 


AATTTCTGTT 


366840 


TTGCCTGAAC 


CAGTCACGCC 


ATCGAGCAGC 


CATACATTGA 


AACCAGAATG 


GAAGAGCAGT 


366900 


TGGCTGAAGG 


CTAAGGCTTG 


CTGCTTATTT 


AAGGTTAAAC 


GATTTTCAGC 


ATTAACAATT 


366960 


GGGTTATTGC 


CTAATCTTTG 


TTGCCAGCTT 


AGGGGATTTG 


TTTGGATAGT 


AATTTCTTCA 


367020 


ATAAACCCTT 


TGGCTTTTAA 


GGCTGACCAA 


ATGGCAGAGG 


AAAAATCATT 


ATTGCCTTTT 


367080 


TCAAGATCAG 


TTTCAGATAA 


GTATTGTAGG 


GCTTCAGCTT 


G CTTTTTCG A 


GCGTTTTAAT 


3 6 714 0 


TCGCCTTGTT 


TAAGGGCATT 


TTTCCCCGCA 


TCCGTAATTC 


GCCAAAAAGT 


GCGGTCATTT 


367200 


TTTACCGCAC 


TTTCGCCGTT 


ACGCAATTTC 


ACAGGTAAGG 


CTTGAAATAA 


CACATCGCCT 


367260 


AATCCCGCTT 


GATAATAATT 


TGCTGCCCAA 


TGAAGCCAAT 


CCCAATAAAT 


GGGCGTAAAA 


367320 
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AGTGGGGCTA AATCCAATGG TTGAAGAATA G C CTTGAGTT TATCCTCTGG CACGTCCGAT 367380 
TTTGTCGGAA AATCAGCCAC AATGGCGACA CGTTTTTGAG TGCCAAAAGG CACAAGAACA 367440 
CGCATACCGA TTTGTAATGA AACATCATCG GGGACGAAAT AATCAAATAA GCGAGGAAGC 367500 
GGCACAGCCA GTGCCACTCG GACAATTTTC ATAATCAACC TTGGAAAAAA TTTGCCCTAT 367560 
TATACGAGAA AAATTGGTAT AATCGCCGCC AATTTTTTCA CGAACAAGAT TTTTATGACA 367620 
CAAACTTTTG CCGATCAAAA ACGTAAAACC GTTGAAACCG CAGAATTTAC CGAAGATGGC 367680 
CGCTATAAAC GCAAAGTCCG TAGTTTTGTT CTCCGCACAG GTCGTTTGAG CGAATTTCAA 367740 
AGAAATATGA TGAATGATAA CTGGGGAACA CTTGGTTTAG ATTACCAAAC GGAACCTTTT 367800 
GATTTTGCGA AGATTTATGG CAATGATAAT CCAGTTGTAT TAGAAATTGG CTTTGGAATG 367860 
GGAAAATCGC TCGTGGATAT GGCTTTTGCT AATCCTGACA AAAATTACCT TGGCATTGAA 367920 
GTTCATACTC CTGGCGTTGG TGCTTGTATT GCTTATGCAG TGGAAAAAGG CGTAACGAAT 367980 
TTACGCGTGA TTTGTCATGA TGCAACGGAA ATTTTACGTG ACAGTATTGC AGATGGTGCG 368040 
CTTGGCGGTT TGCAGrTATT TTTCCCCGAT CCTTGGCATA AAGCAAAACA TCATAAACGT 368100 
CGTATTGTTC AACCGCACTT TGTGACACAA GTTATACAAA AATTAGGTGA AAATGGTTTT 368160 
ATCCATATGG CGACAGACTG GGAAAATTAT GCAGAGCAAA TGCTAGAAGT ATTAAGCGCA 368220 
AACACCGATT TAGTGAACAC CTCGAAAAAT GGTGATTATA TTCCAAGACC AGATTTTAGA 368280 
CCTTTAACTA AATTTGAAGC CAGAGGTTAC AAACTTGGAC ATGGTGTTTG GGATTTATAT 368340 
TTTGTGAAGA AATAAATCAG TTTTAGGCAA ACTCGATGCC TAATTGATGA ATTTAATATA 368400 
GAATATGCCT TCAAATATAA CCTTAATAGG AGAAATACAA TGTCTAAATC TTACAATCAA 368460 
CGCCAACGCA AAAAACTTCA TCTCGCGGAA TTTCAAGAAC TGGGTTTTCT TGTAAATTTC 368520 
CAATTTGCAG AAGGCACTGC GATTGAAACT GTAGATGAAA CCGTTGATCG TTTTATTAAC 368580 
GAAGTGATTC AACCAAATGG ATTAGCTTAT GAAGGTAGCG GTTATTTACA TTGGGAAGGT 368640 
TTAGTTTGCC TAGAAAAAAT TGGCAAATGC GATGAAAGCC ATCGTGAAAC TGTGAAAAAA 368700 
TGGTTAGAAA CAAATGGTTT GCAACAAATT GAAGTAAGCG AATTATTTGA TATTTGGTGG 368760 
GAATATCCAA CAAAAGTAGA ATAATCTAAA TTGAAAACAG CATAGTCAAA ATTGCTATGC 368820 
TGTTTTGTTT TAATCCCTTG CAAAATTCAT T AAAC CTTAG ATTTTTTGAT CTATAACAAA 368880 
TAAAGTTCAA ATAATTGAAA AACTACCTAT TTTGGGTTGT TTTCATTCAT TCATCGTAAA 368940 
CACATAATCC GTTGCCTACA GAAGCATTCA GAAATATTTG AAAGACATTA TGACCGTTGA 369000 
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AAATTTACCT CGCAGACAAT TTTTACGCGG TAAATTTTCA ACATTGAGCT GTTTAGAAAA 3 690 60 
TAATCAAAAA CAGAATTTTG TGGGTATTAG GCCGCCTTGG TCTGTGGAAA ATTCTATTTT 3 69120 
TGTAGCACGG TGTACGCGCT GTGGCGACTG TCTTTCTGTC TGTGAAACCA ATATTTTGGT 36.X80 
AAAGGGCGAT GCTGGTTTTC CTGAAGTGCG TTTTGATAAT GGCGAATGTA CTTTTTGTGG 3 69240 
GAAATGTGTA GATGCCTGTA AACAG CCTAT TTTTTATCCG CGTGACCAGT TACCTTGGTC 369 300 
ACATAAAATC GACATTAGCG TAGTTGTTTA ACATTGCACC GTATTGAATG CCGAACTTGC 36.360 
CAAGATAATT GTCCAGCTAA TGCGATTCGT TTCAAATTAC AAATGGGTGG CGTAGcACAA 36.4 20 
CCTTTAGTTA ATTtCGATGC TTGTAATGGC TGTGGGGCTT GTGTACAAGG CTGCCCCGTG 369480 
AATGCAATAA CAATGAATGA TCTAAAACAA AATGAATAAT ACaAATTTAa TATTAGAAAA 36.540 
TGCGAGAGAT TGGCACGTnG TTGGATTGAT CGTGCAAGGT AATCCAAAAA AATTATCAGC 36.600 
AATTC AAAC C GCACTTTTAG CGATTGAGCA TACTGAAATT CCGACTCTTG ATGAAAAACT 36.660 
TGGAAAATTT GTGGTGGTAA TGCAATCGAA CGATCAGCAT CTCTTGCTTG AAAAAATGGA 36 9720 
AAGTGTGAAA GATATCGACG GTGTGATTAA TGTATCGTTG GTGTATCACG AGCAAGATGA 36.780 
ACAAAATAAA TAGTTTTAAA ATCAAACGTT TAACGTGGGG GAAAACGCGA TGAATTTAAG 36.840 
TCGTCGAGAC TTTATGAAAG CCAATGCGGC TATGGCAGCC GCAACGGCAG CGGGGCTAAC 36..00 
CATCCCAGTC AAAAATGTGG TTGCGGCTGA ATCCGAAATT AAATGGGACA AAGCAGTATG 36..60 
TCGTTTCTGT GGTAC CGGTT GTGCAGTATT AGTTGGTACT AAAGATGGAC GTGTTGTGGC 37 0020 
ATCTCAAGGC GATCCTGATG CAGAAGTAAA CCGTGGTTTA AACTGTATTA AAGGTTATTT 370080 
CTTGCCAAAA ATTATGTACG GTAAAGACCG TTTAACGCAG CCGCTTTTAC GTATGACAAA 370X40 
CGGAAAATTT GATAAGAACG GCGATTTTnC GCCAGTTTCT TGGGATTTTG CCGTTCAAAA 370 2 00 
C AATGG CTGA AAAATTCAAA GAAGCGTTCA AAAAGAACGG TCAAAATGCA GTAGGTATGT 370 2 60 
TTAGTTCTGG TCAGTCTACC ATTTGGGAAG GCTATGCAAA GAACAAACTT TGGAAAGCAG 3703 20 
GTTTTCGTTC TAACAACGTA GACCCGAATG CGCGTCACTG TATGGCATCT G CAG CGGTTG 370380 
CGTTTATGCG CACCTTCGGT ATGGATGAAC CTATGGGTTG TTATAACGAC ATTGAACAGG 370440 
CAGATGCTTT TGTTCTTTGG GGCTCAAATA TGGCGGAAAT GCACCCAATT TTGTGGTCGC 370500 
GTATTACTGA TCGCCGTATT TCTAATCCTG ATGTTCGTGT CACTGTACTT TCTACTTACG 370560 
AACATCGTAG TTTTGAACTT GCCGATCACG GTTTGATATT TACACCGCAA ACTGATTTGG 3706 2 0 
CAATTATGAA CTACATCATC AATTATCTTA TTCAAAATAA TGCGATTAAT TGGGATTTTG 370680 
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TTAATAAACA TACCAAATTT AAACGCGGAG AAACGAATAT TGGCTATGGT TTGCGTCCAG 370740 
AGCATCCATT AGAAAAAGAC ACGAATCGTA AAACAGCTGG GAAAATG CAC GATTCTTCTT 370800 
TTGAAGAATT AAAG C AAC TT GTATCAGAAT ATACAGTGGA AAAAGTATCG AAAATGTCTG 370860 
GGTTAGATAA AGTCCAGTTA GAAACTTTAG CGAAACTTTA TGCTGATCCA ACGAAGAAAG 370920 
TGGTTTC CTA CTGGACAATG GGCTTTAACC AACATACACG TGGTGTGTGG GTAAACCAAT 370980 
TAATCTACAA TATTCATTTA CTTACTGGAA AAATTTCAAT CCCAGGTTGT GGGCCATTTT 3 71040 
CATTAACTGG TCAGCCTTCT GCTTGTGGTA CGGCGCGTGA AGTAGGTTCA TTCCCTCATC 371100 
GTTTACCTGC CGACTTAGTG GTAACTAATC CGAAACACCG TGAAATTGCT GAACGTATTT 371160 
GGAAATTACC AAAAGGTACG GTTTCTGAAA AAGTGGGGTT ACACACAATT GCACAAGACC 371220 
GTGCAATGAA TGATGGCGAA ATGAATGTGT TATGGCAAAT GTGTAACAAT AATATGCAAG 371280 
CAGGGCCAAA CATTAATCAA GAGCGTTTGC CAGGCTGGCG TAAAGAAGGC AACTTCGTGA 371340 
TTGTTTCAGA TCCTTATCCA ACTGTATCCG CACTTTCTGC TGACTTAATT CTTCCAACGG 371400 
CAATGTGGGT GGAAAAAGAA GGGGCTTACG GCAATGCAGA ACGTCGTACC CAATTCTGGC 371460 
GTCAG CAAGT CAAAGCACCG GGCGAAGCAA AATCAGACTT ATGGCAATTA ATGGAGTTCG 371520 
CAAAATACTT CACGACGGAT GAAATGTGGA CAGAAGACTT ACTTGCACAA ATGCCTGAAT 371S80 
ATAGAGGTAA AACTTTATAT GAAGTGCTTT TCAAAAATGG TCAAGTAGAT AAATTCCCAT 371640 
TGAGCGAACT TGCTGAAGGC CAATTAAACG ATGAATCAGA ATATTTTGGT TACTACGTTC 371700 
ACAAAGGTTT ATTTGAAGAA TACG CTGAAT TTGGTCGCGG TCACGGTCAC GATTTAGCTC 371760 
CATTTGATAT GTATCATAAA GCGCGTGGTT TACGTTGGCC AGTTGTGGAG GGTAAAGAAA 371820 
CCTTATGGCG TTACCGTGAA GGCTATGAtC CGTATGTTAA AGAAGGGGAA GGCGTGGCTT 371880 
TCTACGGTTA TCCTGATAAG AAAG CGATTA TTCTTGCGGT GCCTTATGAA CCACCAGCTG 371940 
AGTCTCCCGA TAATGAATAC GATTTATGGT TATCAACAGG TCGTGTTCTT GAACATTGGC 3 72000 
ATACGGGGAC AATGACACGC CGTGTACCAG AATTGCATCG AGCATTTCCA AATAATTTAG 372060 
TATGGATG C A CCCATTAGAT GCACAAGCAC GTGGTTTACG TCATGGCGAT AAAATTAAGA 372120 
TTTCATCTCG TCGAGGCGAA ATGATTTCTT ATTTAGATAC TCGCGGACGT AATAAACCAC 372180 
CTCGTGGCTT AGTATTTACC ACTTTCTTTG ATGCAGGTCA GCTTGCAAAT AGCTTAACAC 3 72240 
TAGATGCGAC AGACCCAATT TCAAAAGAAA CCGACTTCAA AAAATGTGCG GTAAAAGTGG 3 72 3 00 
AAAAGGCTG C G TAAAAACC A TGTAAAAATC AACCGCACTT TATTTTCTCC CATCAATAAA 3 723 60 
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AGAGGAAGAT GAAAGTGCGG TTAAAATCTA AAAAGAAAAT GAAGAAACCC GCACTTAATC 3 7242 0 

CTGAACGTAG AAAATTTCTT AAAGAAGCAA CCCGTACTGC GGGTGGTTTA GCTGGCGTTG 3 72480 

GGATTTTGTT GGGATTGCAG CAAAATCAAA GTCTTGCGCG CGAAGGTGTG CCATTACGCC 3 72 54 0 

CTCCATTTGC TTTGCAAGAT GCGAAAGCGT TTTCTGCCGC TTGTATTCGT TGCGGACAAT 3 72600 

GTGTTCAAGC CTGTCCTTAT GACATGCTGC ATTTAGCCTC TTTATTATCG CCAGTAGAAG 3 72660 

CGGGAACGCC GTATTTTATT GCGCGTGATA AACCTTGTGA AATGTGTCCT GATATTCCTT 3 73720 

GCGCAAAAGC CTGTCCTAGT GGTGCATTAG ATCGTCAAGC AACAGATATT AATGAATCGC 3 72 78 0 

GTATGGGGCT GTCAGTACTG TTAGATCACG AAACTTGCTT AAATTATCAA GGATTACGTT 3 72 84 0 

GTGATG TTTG TTATCGAGTT TGTCCATTAA TCGATAAAGC GATTACATTG GAAACACAAC 3 72 900 

ATAATC CACG TTCTGATAAG CATGC CTTAT TTATTCCAAC AGTACATTCT GATGCCTGCA 3 72 96 0 

CGGGATGTGG AAAATGTGAG CAAGCTTGTG TGTTAGAAGA AGCAGCGATT AAGATTTTAC 3 73 02 0 

CAATGGATCT GGCAAAAGGT ATGTTAGGTA AACATTATCG TTTAGGTTGG GAAGAAAAAG 3 73 08 0 

CGAAAG CTGG CCATTCTCTT GCACCTAAAG ATATGATTTC ATTGCCAACT CG AAC AC C AG 3 7314 0 

AAGGAACAAC TGTAATTCCT GAACCAGCAG AACCCGTGCT TGCGCCTATT TTAGGGAGTG 3 7320 0 

GCAAATAATG GCAAATGCAC CAAAATTTGC AGGGAAAGAA tCGCGCGAAA AATGGGGCTG 3 73260 

GTGGTATGCA AACCGCTTTT TGTTTTGGCG TCGATTAAGC CAACTTAGCA nTCTTGCGAT 373 320 

GTTTTTGAGT GGCCCTTATT TTGGCGTGTG GATTTTAAAA GGTAATTATA GTGGTAGTTT 3 73380 

ATTACTCGAT ACTATTCCAC TAAGTGATCC TCTAATTACC GCTGAAAGCT TGGCTGCAAG 3 73440 

ACATTTACCA GATGCTCTCA CTTTAATTGG TGCTGCAATT ATTGTGTTAT TTTACGCAGT 3 73 500 

ACTTGGTAGT AAAGTGTTTT GTGGTTGGGT ATGCCCACTA AATGTTGTGA CTGACTGTGC 3 73 56 0 

AGCGTGGCTT CGCCGTAAGT TAGGCATTCG CCAAACAGCA AAAATTTCGC GCGGTTTACG 3 73 S 20 

TTATGGTATT CTTGTGCTTA TTTTACTAGG CAGTAGTGTC AGCGGAATGT TGCTTTGGGA 3 7 3680 

GTGGGTTAAT CCAGTGGCTG CTCTTGGTCG CGCCTTTGTA TTTGGTTTTG GTGCAACAGG 3 73 740 

CTGGTTATTA CTTGTTATTT TTCTTTTTGA TTTATTGATT GCTGAACACG GCTGGTGTGG 3 738 00 

ACATCTTTGC CCAATTGGTG CTGCTTATGG TGTGATTGGT GCAAAAAGTT TAATCCGTAT 3 7 3860 

TAAAGTGATT GATCGCG CT A AATGCGATAA TTGTATGGAT TGTTACAATG TTTGTCCCGA 3 73 920 

AGCCCAAGTG TTACGTTCGC CTTTACACGG AAAAAAAGAC GAAAGCCTAC TAGTGCTTTC 3 73 980 

CAAAGATTGC ATCAGCTGCG GACGTTGTAT TGACGTTTGC GCTGAAAAAG TTTTTAAATT 3 74040 
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TTCGACCAGA 


TTCGATCATT 


CAGGGGAGTG 


ATTAATATGA 


CTAAACAGGT 


ATCTAAAATT 


374100 


T TAG CGGGAT 


TATTTACCGC 


ACTTTTTGCT 


GGCTCACTGA 


TGGCATCTGA 


TGCACCAGCA 


374160 


GTGGGAAAAG 


ATTTAACTCA 


AGCGGCAGAA 


AATATTCCAC 


CCGCATTTCA 


TAATGCACCG 


374220 


CGTCAAGGCG 


AATTACCCGC 


ATTGAACTAT 


GTAAATCAGC 


CTCCAATGGT 


TCCACATAGT 


374280 


GTGGCAAACT 


ATCAAGTGAC 


AAAAAATGTA 


AATCAATGTT 


TAAATTG CCA 


CAGCCCAGAA 


374340 


AATTCTCGTT 


TGAGCGGTGC 


AACTCGAATC 


AGCCCAACTC 


ACTTTATGGA 


TCGTGATGGA 


374400 


AAAGTAGGTT 


CAAGTTCATC 


GCCACGTCGT 


TATTTCTGTT 


TACAATGTCA 


CGTTTCTCAA 


374460 


G CAAACGT AG 


ATCCGATTGT 


TCCAAATGAT 


TTCAAACCGA 


TGAAAGGTTA 


CGGAAACTAA 


374520 


TTTAAGAGGT 


CTATATGTCT 


GAGAAAAAAC 


CAAATATCTT 


AAAGCGTTTC 


TGGCAATGGT 


374580 


TTAGAAAACC 


AAGCCGTATG 


GCGATAGGTA 


CAATTATTAT 


CTTATCGGCT 


ATTGG CGGTA 


374640 


TTCTTTCTTG 


GGTTGGTTTT 


AACTATGGTT 


TAGAAAAAAC 


CAACACTGAA 


CAGTTCTGTG 


374700 


CAAGCTGCCA 


TATGCAAGAT 


GCTTATCCAG 


AATATTTACA 


TAGCGTGCAT 


TATCAAACTC 


374760 


GTACTGGTGT 


GGGGGCAAGT 


TGTCCAGATT 


GTCACGTACC 


ACACGAATTT 


GGTGCAAAAA 


374820 


TGAAACGTAA 


AATTATTGCA 


GCTAAAGAAG 


TATATGCGCA 


CTATACTGGA 


AAAGTGGATA 


374880 


CCTTAGAAAA 


ATTCAATGCA 


CATCGTTTAG 


AAATGGCTCA 


AAATGAATGG 


GCGCGTATGA 


374940 


AAG CTAACGA 


TTCTAAAGAA 


TGTCGTAATT 


G C CAT AATGT 


AGATCGTATG 


ACCTTCAATG 


375000 


ATCAACGTTC 


TGTGGCGGCA 


CGTATGCACC 


AAAAAATGAA 


AACTGAAGGT 


AAAACTTGTA 


375060 


TCGACTGTCA 


CAAAGGTATT 


GCACACCAAT 


TACCTGATAT 


GAGTGGCGTT 


GAGTCAGGCT 


375120 


TCAAAGACGA 


GAAGTGATTT 


TCTAAAAATA 


GGCTGAAAAA 


ATATCGCACT 


TTTTTCAGTC 


375180 


TATTATAAAT 


TTGGGTAGTT 


TATAAAAAAA 


TCTGCACTTT 


AATTCGCTGA 


TTTATTCAGC 


375240 


CTTTAAGGTG 


CAGATTAATT 


ATATTTTCTA 


AAGTGCGGTT 


AATTTTTAGA 


TTATTTTTAG 


375300 


CCTAAGATTT 


TATCTAACTC 


TTGGCTAACT 


TCTTCCACTT 


TTTGTGTACC 


GTCTAAACGG 


375360 


AAATATTGAG 


TATTCCCCGC 


TTTCGCTTCT 


GCTTGGTAAT 


AATCAATTAA 


TGGGCTAGTT 


375420 


TGTTTATGAT 


ATACGGCTAA 


ACGATCTAAT 


ACAGTTTCTG 


GTTTATCGTC 


TGCACGAATA 


375480 


ATTAAATCTT 


CGCCTGTTAC 


ATCATCCTTA 


CCTTCCACTT 


TTGGTGGATT 


ATAAACGATA 


375540 


TGGTAAGAAC 


GACCAGACGC 


TTGGTGTACG 


CGACGGCCAC 


TCATACGTTC 


AACAATCACT 


375600 


TCGTCTGGCA 


CATCAAATTC 


TAAAACAAAG 


TCAATTTTAA 


CACCTGAATC 


TTTCAGTGCA 


375660 


TCCGCTTGTG 


GAATAGTACG 


AGGGAAACCA 


TCTAACAAGA 


AACCATTTGT 


GCAGTCAGCT 


375720 
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TGAGCAATAC 
TCCATTAATG 
ATATCACCAG 
CCTTTACCTG 
AAAACGTGAT 
CACTTTTATA 
CAAAGCATGC 
GCTCCCACAA 
GTAAAAAGTG 
ACAAAAGCAG 
TTCCATAACT 
CCACCAATTG 
TTTAGCATTA 
TTTACAACAA 
AACGTTGTGG 
GCCTGAATGA 
AACGGCTGAT 
AAGATGCCTG 
GCCGCCAAAT 
TGAATAGTGT 
AACACGATAT 
CCCAGTTGTG 
TGTGAAAGCA 
AATGGTGCCC 
AG CTC AATTG 
AATGGCAAAC 
TGAGAAGAAA 
GCCTTTCAAT 



GATCTTTTAC 
CTTTAGCTTG 
TTGAAATTTG 
CACCCGGTGC 
TAAGATACTG 
CTGAGCTGGT 
CGGGAATCGC 
CATAACCAGA 
CAAGCTGTGT 
CGGTACCAAG 
CGGCAGATGT 
TTACCATTTG 
TCATACCACC 
TAGCAATATC 
CAAAAGAATC 
CGCCCTTACG 
TAGTTTGCTG 
CTAACATACA 
ATAGCGATAA 
TTC CTAAACC 
CTTGTGTAGC 
TGAGCGGATC 
ACATCCAGCT 
ACAAAAATTT 
AAAGATGTTT 
CAGAATTAAA 
GGGAATTTGA 
TAAAGGATTT 



AAGGGCAACG 
TTTGCCAAGT 
CGGGATACCA 
ACCTAAAAGA 
ATAAAGCATA 
CTTATTATTT 
TAAAAATAAA 
AAGTATACCT 
TGCAGTATAA 
TCCAACGCCG 
AATAACATCA 
TACAAGCCCA 
AGCAAGCCCA 
CTCTTTGCTA 
CCCGAACTTA 
CTGGAAAAAT 
CATGTCTATT 
AAGTGCGGTC 
TCCCCCTGGA 
TAATTCATGA 
TGAAAAAAAG 
AAACAGGCTA 
ACGACGACGC 
CAATCCATAA 
ATCTGTCAGC 
ACCTGTGAAG 
CATAAAGTGC 
TTGTAAAAAT 



GTTAATTCAT 
TCAGTCCCCG 
AATTTGTTCA 
ATAATTTTCA 
GTGCGGTCAA 
TCTTGCTTCA 
CAAAACCAAA 
AAAACTTTAC 
AGAGGATTAC 
ATATATTCTG 
AAATGTCCAA 
AATAGCCACA 
GATAAAATGC 
AATCCCATGT 
TACAAAAACA 
TCTTGTAATG 
TTTGGCTCTT 
CATAAAAAGA 
ATTAATCCCG 
TCACTCAAAA 
GCGATAAGCA 
ATGATGTAAA 
CCTAAAAAAC 
GGCAACATCA 
CAAACAGGCA 
GCGCAAATTA 
GGTTAATTTT 
TAGGGGATTT 



CTGGCACTAA 
CTTTGATTGC 
TAATAAATTG 
TAAAAAATCT 
ATTTTTTATA 
AAGGTGCTAC 
AATACTGATA 
TTGGCAAAGC 
TCTCACGCGC 
CTGCAATCAC 
AAGCAGCCAA 
ATGCACGATT 
TTGACCAAAG 
CATAAATAAA 
AAAATAGTAA 
GAATCCAAAA 
TGGCTAAAAA 
CGGTTTCCCA 
CAATTCGATA 
TTTCACGACG 
ACGCAATATT 
GTAAAATCAA 
TCGGGAAATA 
CACCCGTTAC 
ACATTTGCGA 
ACATCTTGCG 
AGTTGTATTT 
TTAATCATTC 
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TTTACCTTCA 3 7 5780 

AGCACGGAAC 3 7584 0 

TGCTTGAGTG 3 7 5900 

CCATTAATAC 3 75960 

ATTTTTACCG 3 76 020 

CCAAAATAAA 376080 

ATACCCCACT 376140 

TGAAAGGCTT 3 76200 

CATAAAAGCC 376260 

AACACCTAGT 3 76320 

CCAAATAAAC 376380 

AATACCCAGT 3 7644 0 

TGCGGTACTT 376500 

TTTGGTTTGT 3 7656 0 

AAAACCGATA 376620 

TGCTTGATAG 3 766 80 

TAGTGTCATG 3 76 740 

AGGATAAATC 376800 

AGCATTAATA 376B60 

ATACGCATCC 3 76 920 

TGCTACTGTA 3 76 98 0 

TAATGCCACT 3 7704 0 

ACGATCTAAT 3 77100 

TGCACCAATA 3 77160 

CAATACAAAT 3 77220 

CGTAAAAATT 377280 

T AGAAATTT C 3 7734 0 

CCCTAATGTT 3 774 00 



BNSDOCID: <WO 9633276A 1 J_> 



WO 96/33276 PCT/US96/05320 

TAG CATTCTT CTTCCTAAAA AAGAAAAACA AATATTAGAG AATATTAATC TCTATACCCT 3 77460 

TTTGGATTAT TTTTCTGCCA GTTCCACGTG TCTTGCATCA TTTTTTCAAG ACCACGTTCT 3 77520 

GCCACCCAAC CAAGCTCTTT TGCCGCTAAA CTAGGATCAG AATAGCAAGT GGCAATATCG 3 77580 

CCAGAGCGGC GTTCTACAAG TTTATATGCA ATTGTAATAT TGTTGGCTTT TTCAAAGGCT 3 7764 0 

TTTACCATAT CTAATACGGA ATAACCATGG CCCGTACCAA GGTTATAAAT ATGTAAGCCA 3 77700 

GCGTCATTTT CATGACGTTG AAGCGCTTTT AAATGCCCCA CAGCTAAATC CACTACATGA 3 77760 

ATATAATCAC GCACACCTGT TCCATCATGA GTGTCGTAAT CACTACCAAA TACAGAAAGT 3 77820 

TGTGCTAATT TACCGATAGC AACTTGGCTA ATATAAGGTA ATAAATTATT TGGAATACCA 3 77880 

TTTGGaTCTT CACCAATCAA GCCACTTTCA TGCGCCCCAA CTGGATTAAA ATAACGCAAG 3 77 940 

ATAGTCATGC TAAATTTTGG TTCCGCTTTT GCTGTATCGC GTAAAATCTG CTCAACCATA 3 78000 

TATTTAGATG TACCATAAGG GTTGGTTGTA CCGCCGACTT CACAATCCTC TGTAATTGGr 3 7 806 0 

ATAATTTTTG GnTCGCCGTA AACCGTTGCA GATGAGCTAA ATACAAAGTT CCAAACACCT 3 7812 0 

GCTTTTTTCA TTTCTTGAAT TAATACAAGG GTGCCAGCGA CATTGTTCAT GTAATATTCT 3 7818 0 

GTCGGCTTTT GAACACTTTC CCCTACGGCT TTTAACCCTG CAAAGTGAAT AACCGAGTTA 37824 0 

ATCTCATTTT CTGCAAAAAT TTTTTGTAAC AAAGCACGAT CTAAAATATC GCCTTCATAA 3 783 00 

AACTTTGCTT CTTTGCCTGT AATTTGTTTT ACGCGCTCAA GGGATTTTGG CGATGAATTG 3 7836 0 

CAAAGATTAT CTAATACCAC CACCTCTTTG CCAACATTTA ATAATTCTAC AACTGTGTGA 3 78420 

GAACCGATAT AACCAGCGCC ACCCGTCACT AAAATGGCCA TAATGATTTC CTTGTTAATT 3 784 80 

ATTGAGACTA AAAAATTATA TCACTATTTT GTGTGGTAAG AATGGTGATT TTCAATAAAA 3 78 54 0 

TAAAAAACCT GTTGCTAAAA AGTATTGTTG TAGCGAGTGA GATAACCTCT CATATTTAGG 3 7860 0 

GCTATGTCCA TTAGTTGTGT GATATAAGGT TATCTAATCC CATTTTCTTG ATTTTAAGGC 3 7 866 0 

GTGTTTTATA TCATTAGGTA AACGTAATAT ATCCCAGATC AACCGAAAAA TCAGATTTTG 3 7 8720 

GTAAATTCTT GGTTTGGAAT ATATACCAAT TTTTTTATAT ACAAACTTCG GCGGAATTAA 3 787 80 

AATATCTTGT GTGTAATTTT TCTTTTCTTC GAGAATAAAA GTAATTGGGT TATTCAGTGG 3 7884 0 

TGATAAAGGA AAATAATTAC ATAGAGGACT TTCTGGCGAA ATG CAAT AAA TATTTACTCC 3 78 90 0 

ATAATGTTTT TGTAAAAAAT AAAGTGCATT TAAATCATAT TCCATAGAAT GGATATCG CT 3 78 960 

TTGACTTTTA TTTTGTGAAA AAGAAGGTAA TAATTTAATA ATATTTTCTT TTTGATGATG 3 7 9020 

AAATGCGTAA GGATTCCCTT TTTCTTGATA AAAATCAATA CCTGTTAAAT AAAG AT CTTT 3 7 9080 
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ATAACCCATT ACAGTTGCCA CTGCACACAT ATAAACGCCT GATGTAATTC TCTTATTTTC 379140 
ATATAATTCG TTATATTGTA AATAGGCATC AAAGGCGGGT AGCTTGTTTA AATAATAATG 379200 
TCCAATATCA ACTTGTGTTA GTAATTTTTG TACATTTTTA ATTTTCTTTA ATTCTGAATC 379260 
TCCTAAATTG ACAAAAGATG ATAAAATAAT ATCAGCATAT TCATATTCAT TATTTTTAAT 379320 
TAATTGCATA AAAGTATAGT ATTGCtCAAA AATTGTAGAA CAATTAAAAA ATACtTTTTT 379380 
TATTTTCTTG CCAAGAAAAT AATGATCTTC AAAATAAAAT TGATTGCAAC GGAAAACATC 379440 
ATAATCTTTA GGTAATAAAC TATAGTCAAT TGATTTTAAA CTTGTTCCAT TACCTGCAAT 379500 
AATGACAGAC TTTGATTGAT TGATTGATTG ATTGATTGAT TGATTGATTG ATTGATTGAT 3 79560 
TGATTGATTG ATTGATTGAT TGATTGATTG ATTGATTGAT TGATTGATTG ATTGATTGAT 379620 
TGATTGATTG ATTGATTGAT TGACATATTG TACCGTTCAT TGAATAAAAT AGAAGTTATC 379680 
CCTnAAAAAT nATAAACATT ATATAGATTT ATAGAACTTT TTACTAAACG TTGATTATAC 379740 
CTTCCtAAAA nTATTATCTA ATATCAGTTG TACGATCTCC GTTGTATCTT AGCAAGAGAA 379800 
ATTTAAAAAT GCTTTTCTTG TTCCATTCGG ATAGAAACCT AATTGCTTGT TTTTTCTACC 379860 
AAACAAGCGT AAACTTACCA ACAGTTTTTA GTCGGGGTGC CGAAAGGCTG AGATGATACC 379920 
CGTGAACCTG AAACAGTTAG CACTGACGTA GGAAACTAAT ATGCCAAAAA ATCCATTTTC 379980 
TTTTCATTCC ATTTCTTATC CTTTCTTTTT TATTACGCAC TCCCTTTAAT CATTGAGGCG 3,80040 
AGCAATGGTG CGTATTCAAG ATTTGAGTCT TCCCTTTAAT GGGCAAACGT TATTTGAGCA 380100 
TCTTAATTTA ACCCTGTTGC CGAACGAATG GGTGAGCTTG TTGGGCAGTT CAGGCGTGGG 380160 
AAAATCGACA CTTTTGCGAC TACTGGCGGG CATTGAAACG CAAGGCATGG TGCAGGGGAA 380220 
AATCCTATTT GAACCGAAAG TTCGCATTGC GTGGTTGCCA CAAAAAGAGA CACTTTATCC 380280 
GTGGTTGTCA ATTGTCGATA ACGTGCAGTT GCAAGCGGTC TTATTTGGGC GAAAATCTGT 380340 
AAAAACCACT GAAAAAGCTA AAATGTTGCT TGAAAAAGTG GGAATGGCAG CACATTGGCA 380400 
CAAGCCTTGT TCGCAGCTTT CGGGCGGGCA AAAACAGCGG GTTGCCTTGG CACGCACCTT 380460 
AATGCAAGAG GTGGATTTGA TTTTGCTTGA TGAGCCTTTT TCCGCTCTTG ATGCCATCAG 380520 
TCGCCATCAA CTTCAAGATT TAGCCTTTGA ACTGCTCGAA GATAAATCAG TGCTGCTCGT 380580 
GACCCACGAT CCACAAGAGG CATTACGGCT AAGTCAGCGT ATTTTTGTGT TGCGTTCGCC 380640 
TGAAACTCAT CAAACAGCAT TATCGGCGGT GATTCTGCCC GAAGGTAACG CCCCGCGTGA 3 80700 
GTTGCATCAA GCCAATTTAT GGACGTTGCA GCAACAATTA TTGCAGGAAT TAGGGGGCGA 380760 
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ottacsgpjttt 


PTATTTGGGT 


TGATTTCGGC 


GTTGTTGCTC 


381000 
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AAAGAATATG CGGTAGAAAT TGAATAAGGA AAAAAGATGA TTGAACAACT CATTCAACAA 382500 

GCTCAGCCTT ATTGGCAACA ATATATTGAA CACGAATTTG TGCAACAGCT CGCAAAAGGT 382560 

ACGCTACCAA AAGCTTGTTT TCAGCATTAT TTGAAGCAGG ATTATCTCTA TCTTTTTCAT 382620 

TACAGCCGAG CCTTTGCCTT AGGTGTATTC AAGGCAAAAA ATTTCGCTGA AATGGAAACA 382680 

CCTCGTAAAA CCTTAGAGAT TCTTTGCCAA GAAATTCAAC TACATTTAAA TTATTG CCGT 382740 

GAGTGGGGAA TTAGTGAGCA AGAGATTTTT ACCACCCAAG AGAGTGCCGC TTGCATTGCT 3 82800 

TATACTCGCT ATTTGCTTGA TTGCGGCATG ACGGGCAGTT TGGCTGAGCT TTATGCAGCT 382860 

GTAACCCCTT GTGCTTTGGG TTACGCTCAA GTTGCTCGTT ACATTACTCA GCATTATCCT 382920 

CGCTTACCGa ACAATCCTTA TCAAACGTGG ATTGACACCT ATGCGAGTGA GGAATTTCAA 3 82980 

CAAGCCGCCC AAGAAACCGT TGATTTTCTT ACCGCACTTT GCAAACCACT AAACCCAAGC 383040 

CAGCTTGCTG AGATTCAACA GATTTTCACC ACCGCAACTA GAATGGAAAT TGCGTTTTGG 383100 

CAAATGGGGT TGGATTTGGC TTAGGCTTAG ATTGGCGATT AGACGAATAA CTTTATTTAT 3 83160 

CTCCAAACGG ATAAATTTAT TGCGATTTTC TACCTCCTCT TTGAGAAAAG AGGGGTAGAA 383220 

AAATTAT CTT ATTCTAATCT TGCTATAAGC TAATGCTATA ACAAAAAATG CGGCTTGCAG 383280 

AAGAATAATA CAAGCACCTG TTGAGGCATC AAAATGATAG CTCAATATTA CGCCAATTAA 38334 0 

GCTTGATGCA ATGGAACTTG CAATAGCAAC C CAT AACATT TTATCAAAAG ATTTAGTAAG 3834 00 

AGTAAGTGCG GTAATTCCTG GTGCAATTAA CATTGCCACC ACCAAAATAA CACCTACCAC 383460 

TTGCATCGTG CTTACAATGG TTAAAGCAAG TAGAATTAAT AAACCATAAT GTAAAATTTT 383520 

AGGAGAAAGT CCTGCAACAC GGGCGTGACT TGGATCAAAA CAATAAAGCA GAAAGTCTTT 383580 

ACGTTTAAAG ACAATCAGAC AAAAAATTAT CGCAGAAATG ACCGCACTTT GAATAAGTTC 3 83640 

TTGATGACTT ACACCCAACA CATTACCAAA CAAAATATGG GTTAAATGCT CTTCTGTTTG 383700 

AATTTTGGTA AACATAACAA GACCAATGGC AAACATTCCA GAAAATACAA TGCCCATTGC 38376 0 

GOTATCTTCT TTTATACGGC TATTTTCTTT CAAATATCCC ACACCAAGCG AACAAAAAAT 383820 

ACCCGAGAAA AATGCACCAA TGGCTAAAGG AATTCCTGCC AAATAAGCAA GTACAATACC 383880 

AGGTAATACA GCGTGAGAAA TTGCATCACC CATTAATGAC CAACCTTTCA AGACTAAATA 38394 0 

GCAAGAAAGT AACGCACAGA TAATCGAAAC AATCAATG CG GTCAATAAGG CATTTTGCAT 384000 

AAACTCAAAT TGTAGCGGTT CTAAAAGCCA ATCAAACATT TTCTACCGCC TTTTTATTAC 3 84060 

GAGTTAATAA GCCATATTTT GGTGAAAATA AGAAAGCGAC TAAAAACAAT AATGTTTGCA 384120 
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ATGTCACAAT CACACCGCCT GTGGCACCAT CTAAATAATA GCTAATATAA ACACCAACAA 384180 
AACTTGTCAC CGCCCCTAAA ATAATAGCGA TGATTGACAA GGTTTTGAAT TTATCTGTGA 384240 
GTAAATATGC TGTCGCACCT GGGGTAACGA CCATCGCAAT CACTAAAATC GCGCCTACAG 384300 
TTTGCAAGGC AGCAACCACA CAAGCACTCA ATAATGTGAA AAATAAAATT TTATAAAATA 384360 
ATGGAGAAAG ACCGACTGTA ATCGCTTGTG TTTCATCAAA AAAAATCAGC AGTAAATCCT 384420 
TCCAAAATAA TAAAAGCAAT ACTAAACAAA CACCGATGAT AATTGCCACT TGATAAATAT 384480 
CTTCATCAGC AATGCCTAAA ATATTACCTA AAATGATATT TTGCACATTT ACTGCGGTTG 384540 
GATTTAAAGA AACGATTAAC AAACCTAATG CAAAGAAAGT ACTGAAAATA AACCCGATAA 384600 
CGGCATCTTC TTTCAGTTTA GAAATAGATT TAATCCAAAG AATGGAAAGT GCGGCTAAAA 384660 
TTCCGGCAAA AAATGCACCA AGTGCATAAG GCAACGCAAA AG CAT AGG CA ATTGCAACAC 384 720 
CGGGTACGAC AGAATGGGAA AGTGCATCAC CAATTAGAGA CCAGCCTTTA AGCATTAAAT 384 780 
ATGAAGACAA AAAAGCACAA ATGCCACCCA CTGCGGTACT CAAAATCATC GCCTTGAGCA 384 840 
TATAATCATA GGAAAAAGGT TCAAGCAATA AATCTAACAT TACCTAATCC CTCTTATTTT 384900 
TAACTGCACT TTTATAAGGG CAATCTTCAA AATGACAATT TTGTGTTGTT GGCGCAGGTG 384960 
GATCTTGCTT AGTTTCGCCA TAGAATACAA CGGCTTTTTC ATCATCTGTT AAGACGGTAA 385020 
CAGAACGTTT ATCTTCATCA TTATGTAAAT TTTCGCCCAA TAATTTAATA TGACGCAACA 385080 
CCCCACCAAA TACAATTTCC AAATTATGTT GGTTAAAGGT GTCTTCTGTT TTTCCTGCGG 385140 
CGATGACTGT GCGATTAATC ATGACCACTT GATCACAAAA ATCAGGAACA GAACCTAAGT 385200 
TATGGGTTGA AACAAGAATC AGATGACCTT CTTCTCGAAG CTGTTGAAGA AGATCAACAA 385260 
TGGCATTTTC AGTTTTCACA TCAACGCCAG TGAAAGGTTC ATC CAATAAA ATAATTGGAC 385320 
TTTGTTGTGC TAAAGCGCGG GCCAAAAACA CGCGTTTTTT CTGCCCGCCA GATAACTCGC 385380 
CAATTTGACG ATGCGCAAGA TGCTCAATAT TTACTCGCTG CATCGCCTCT TGAACTTTTT 385440 
GTTTAT CAAT CGCTTTGGGG ATACGAAGAA AATTCATATA CCCATAACGT CCCATCATCA 385500 
CGACATCATA CACTGACACC GGAAATTGCC AGTCCACTTC TTCTGATTGT GGCACATAAG 385560 
CAACCAAATT ACGTTTCAGT GCTTGGGAAA TTGGCAAATC ACAAAGTTTA ATTTCGCCTT 385620 
GTTGTGGCTT TACAAGCCCC ATTATGCTTT TAAATAATGT GGACTTGCCA CTTCCATTCA 385680 
CGCCAACTAA CGCACAAATA GTTCCACTAT TGAGTGAGAA CGTCATATTG TGAATAGCAG 385740 
TATGACCATT GTTGTAACGA ACAGTTACAT CATTAACCCA AATTGATGTC GAGAAAGAGT 38S800 
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CCATTATTTT CCGAATCCTT TAACAATAGT TGATACAGTG ACGTTAAGCA AATCAATATA 385860 

AGTTGGCACA GGGCCGTTTT TCGCAGAAAG TGAATCAACG TACAATACGC CACCGTATTT 3 85920 

TGCACCACTT TCTTTAGCTA CTTGTTGTGC AGGTTTGGCT GAAATTGTAC TTTCACTAAA 38 5980 

CACAACTGGA ATATTGTTTT TACGAACTAA ATCAATCACT TTACGCACTT GTTGTGGTGT 386040 

GCCTTGTTGT TCAGCATTGA TTGGCCATAA ATAACCTTCT TTTAAGTTGT AATCTTTCGC 3 86100 

CAAATAACTG AAAGCACCTT CACTTGTTAC AAGCCAGCGT TGCGCTTCTG GAATTTGTGC 386160 

TAATTTTGCA CGAAGCGGTT CATCAAGTTG TTTAATTTTT TGAGCATAGT CAGCGGCATT 386220 

TTTTTCATAC ACAGCTGCAT TTTGTGGATC GTATTTCACT AACGCATTCT TAATATTTTC 386280 

AATATAAATT AAAGCATTAG ATGGCGACAT CCAAGCGTGT GGATTAGGCG CATCTTTATA 3 86340 

CGGGCCTTCA TAAATAGATA ATGGTTGAAT ACCTTCTGTT ACTACCACAG CAGGTTTATC 3 86400 

TTTAACATTT TGGAAGAAAC GCTCAAACCA ACGTTCTAAA TTTAAGCCGT T C CAT AAAAT 3 86460 

TAAATCAGCA GATTGTGCTT TTACAATGTC TTTTGGCGTT GGTTCATATT CGTGAATTTC 3 8 6520 

AGCACCTGGT TTGGTAATAG ATTC CACTGT TGCCGCATTG CCTGCCACAT TTTGCGCAAT 3 86580 

ATCTTGAATC ACGGTGAAGG TGGTAACAAC TTTGAATTTT GCATTAGCTT GCATTGCAAA 3 8664 0 

TAAACCAAGT GCAAGTGCGG TCATAATTTT GAATGAATTT CGCATAAGAT GTCCTTATTT 3 86700 

AAAAAAGATA GATTGAATGC CTGAATAATT GTTCAATAGC TCAACAATTA TTCAT CGGAG 3 8676 0 

ATAATAGTAC TACTTTTTAT ATTGTTTTGG AATGGGAATG ATTATCAAAA TAATAGATTT 3 86 82 0 

ATTTATAATT ATTTCGTACA AAATGTATTG AATGTATGAA AAAAGAACAA AACATTTTGG 3 8688 0 

TTAGTATTAA GTTCAGCAAT AGTCTTGGCT GCTTGTGATG ATAAACCTAA AATGACAGAA 3 86 94 0 

CAAACCAAAA CAACTGACCA AGAAAAAGTC ACTGTAGAAC AGCCTGCTCA TCAGCTAAAA 3 87000 

AATGCGAGAT AAAACAAACT GTTATCTCGC ATTTTTATGT TTATGAACAT TATTTTGATA 3 8706 0 

ATTTCTGAAT TAGCGTATTA AATCCCACTT CATCATAGCC ATCTTCATAT TGGGCTTCTC 387120 

CTGCTACATC ACGAATTGTT CTTCCATTTA ATGCCGTGCA GCCGAATTTT TCTAATTTGC 3 8 7180 

TTTTAGTAAA ACTAGAACGA TACGCTGAGG AAGTGTTATT TAAACTAATT ACGCCAATAG 3 87240 

TTGCATTTTT GCATCGGTGT TTATTAATGA AAACGAGATT ATAACCAGAT TCTTGAATGT 3 87300 

AGC CTGTTTT ATTCACTGCC GCATCAAAAA TTTCATCACG CACTAACTTA TTGGTATTTT 3 87360 

TAATATAAAG TTTTTGTTTC CCTGCTTGGA TATAAGTTGC ACTAAGATTA OATAAACGTT 3 8 7420 

TAATGTCCGA TTTATTTAAT GAATATTTGG CGAGCTTAAC TAAATCCATT GGGCTGGAAA 3 8 7480 
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TATTATAGCT AGATAATCCT GAACTATCGT GAAAAC GAGT AGAATACATA CCAAGTTGAT 3 87540 
GGGCTTTTTC QTTCATTTTC TTAATAAATT GGCGACGACT TATTCCCGCT GCACGAGAAA 387600 
GCGCGTGGGC GGCATAGTTA TCAGAATGGA CAAGCATTGC TTTCAATAAT TCGTTACAAG 3 87660 
AAATAGGAAT ATTTTTAGGT AATTTTGTTC CAGTGCCTTT AATTCGATCC GTATCCTCTT 3 87720 
TTGTGATTGC GATTCGACAA TTAGGATTTT TATTATTCTC TAAAAATACA TTAGCCGTCA 387780 
TTAATTTTGT TACAGAGGCA AT AGG CTGAA TACTATCAGA CGCGTGGCTT TCCAACACTC 3 87840 
GATTATGGGT AAAATCATAC ACAACATAAG ACTGTGCCAA CGCAAAAGAA GGACACAAAG 387900 
ATAAAATAAA TAAAGCTTTT TTAAACATTA TTGGTCACCT TAAAAAATAG G CTGAT ATTT 3 8 7960 
TACCGCACTT TTTCCG CTGT AATTGAAAAT ATTTGCGATC TAGATCGCAA AAAAGTGCGG 3 88020 
TAAGATTTTC TATCGTTTTT CAATTCCGCT AGATTATGCG GTGACAAATC TGTACAATCG 388080 
CCGTTCATTT TTTAATTGCG AATTTTTCGT TGTGAGTTTT TGATAAAAAC ACCTGCACAA 3 8814 0 
TGCTATCGCC ACAATGTGTA ATAACGAGGC TAAAATGTCA GAATTATTGT CTGTGCAATC 388200 
AGATGCACCC GCTAAAAAAA TTAATCTAAT GGATTTAACG CGCCAACAAA TGCGCGAATT 3 88260 
TTTTAAAGAA CTCGGCGAAA AACCATTTCG AGCAGATCAG TTAGTTAAAT GGATTTATCA 388320 
TTTTGGCGAA GAT AATTT CG ACAATATGAC AAACATTAAT AAAAAATTAC GTGAAAAACT 388380 
TAAGGCGGTT GCAGAAATTA AAGCCCCAGA AGTTGCCGTT GAGCAGC GTT CAGCCGATGG 388440 
CACGATAAAA TGGGCAATGC AAGTGGG CGA GCAGCAAGTT GAAACTGTGT ATATTCCCGA 383500 
AGCAGATCGC GCGACGCTTT GTGTATCTTC CCAAGTTGGC TGTGCCTTAG CTTGTACCTT 388560 
CTGTTCAACC GCACAACAAG GCTTTAACCG CAATTTAACG GTTTCAGAAA TTATCGGTCA 388620 
AGTATGGCGA GCCTCTAAAA TTATTGGCAA CTTTGGCGTA ACTGGCGTTA GACCCATTAC 388680 
AAATGTGGTA ATGATGGGAA TGGGCGAACC CTTATTGAAT GTTGCTAATG TTGTTCCTGC 388740 
AATGGAAATT ATGTTAGATG ATTTTGCCTA CGGTTTATCA AAACGTCGTG TAACCTTATC 388800 
GACTTCTGGT GTTGTGCCAG CATTAGATAA TTTGAGCAAA ATGATCGATG TGGCTTTGGC 388860 
TATTTCATTA CATGCCCCAA ACGATGAATT ACGTGATGAA ATTGTGCCAA TCAACAAAAA 388920 
ATACAATATC AAAACATTGA TTGACTCGGT AAATCGTTAT TTAAATGTAT CAAATGCAAA 388980 
TCATGGCAAA GTGACGATTG AATATGTGAT GCTCGATCAT GTAAATGATG GCGTAGAACA 389040 
CGCTCATCAA CTTGCAGAAG TATTAAAAAA TACGCCTTGT AAAATTAATT TAATTCCGTG 389100 
GAATCCGTTC CCAGAAGCAC CTTATGCAAA AAGCTCGAAT ACGCGTATCG ATCGCTTCCA 389160 
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AAAAACCTTA ATGGAATATG ATTTTACCGT GATTATCCGT AAAACCCGTG GAGATGATAT 3 8 9220 

TGACGCAGCC TGTGGGCAAT TAGCGGGAGA TGTTATCGAT CGTACTAAAC GTACGGCAAT 3 89280 

GAAACGCCAA TTTGGACAAA ATATTGGGGT GACAGAAGTT AATTAATATT CTAGGTATGT 3 8 934 0 

GAATTAATCA AGTTCACATA CCCCTATGTT TT CAT AC AAA TTAAAGCATA TTAAGTATTT 389400 

CTTCCTGCCC AATTAAAaGG AGGAaaAATG AAAACAATAA GTAAACAACT AAGTGCGGTT 3 8 9460 

ATTTTCCCTT TCATTTTTTC TGCCTGCGTT TCACAATCTG CATCTAGCTT AAATCATCAA 3 8 9520 

ACTGCCGCTA AAGCGCGTGT GGAACTCGCT TTGAGCTATC TTCAGCAAAA TAATCCTCAA 38 958 0 

CTGGCTAAAA TCAATTTAGA CAAAGCACTT CAACACGATA AAAATTACTA TCTCGTGCAT 3 89640 

TCAGCACTTG CACATTATTA TCAACAACAA GGGCAAATAG AAAATGCCTT TCGTGAATAT 3 8 9700 

GAAATAGCCG TAAACCTTAA TCATAAACAA GGCGATGTGC ATAATAATTT TGGTACGTTT 3 8 976 0 

CTATGTAGTC AAAAGAAATT TGAACAAGCT CAGCAACAAT TTGAATTAGC ACTTAATTCG 3 8 982 0 

CCGAATTATT ATCATCAAGC AGATACATTT GAAAATATCG TGCTTTGTGC TTATTCCGCG 3 8 988 0 

CAAAAAATGG ATATTTATCA GCAAACATTA GAAAAATTAC GTCAAATCGA TGGAAAACGT 3 8 994 0 

GCGGAAAAAT TCAATAGCCT AAAATAAAGC ATTGCCAAAT CAAGTATTGG ACTTTAAAAT 3 90000 

TTAGCCTCCA TTTACCGCCT AATTATTTGA TTAAATTTAT GAATCCAACT ACTG AG CAAG 3 9006 0 

TGCTTTCCCC TGGTGAGATT TTTCGTCAAA CACGCGAAGC CTTAAATCTT T C TCTTGAAG 3 9012 0 

ATGTCGCTAA AGAAATCACG CTACGCCCTT CTATATTAGA GCAACTTGAA AATAATGAAT 390180 

TTATTCAAAA ATCTACCCCT GCAATCTTTG TAAAAGGTTA TGTGCGTAGC TATG CAAAGT 3 90240 

TCTTACGTTT GCCAGATTCT GTGTGGGAAA ATATTGTTTT TGCTGAAACA GAAAAAAATG 3 90300 

ATTTAGGGAA AAATGCACGT TCAACCCGTG CCGTTAATCA ATATTCCTCT CACAATCGTT 3 903 60 

GGATCGGGCG TTTAACCGCA ATAGTATTCA TGATTGTAAT TGGTATGACT GGGCTTTGGT 3 9042 0 

GGTGGCAGAG TTATCAACAA AATACTCAAG AACGCGATGA TTTAGTGCAA AGCTATGTTG 3 90480 

CCTCTACCGA AAACAATCAG CCCGCCACCG CTCTTGTGAC AACAGAAGAA AGTAATAAAA 3 9054 0 

CTGTACCAGA AACAGCAGCA CCAGTCTCAC AACCAGTAGA AATAACAAAT AATCTGTTAC 3 90600 

CAGAAATAGC ACAAGAAAAT AGCGTTTCAC AACCCAAAAA CGATGAAAAA TCAGTTTCAG 3 90660 

AAATTCAAAG TGCGGTCGAA AATCCAAGTA TTTCTCCAAC ATTACCTATA GCAAAAGGCG 3 9072 0 

ATTTAGTGAT TGAAATACTA ACAAATAGCA GTTGGATTAG TGTAAAAGAT AACGCTCGCC 3 9078 0 

ACG T ATT AG C ACAAAAAGAA TATAAACAAG GCGAAATATT AACCTTTAAT GGCAATGAAT 3 9084 0 
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TTTCATTAAT TGTAGGCGCG CCAAGCAATG TGCGCATCAC TTACAAAGGC GAGAATTATC 390.00 
CTCTGAAAGT GGATGGTCGT GTAGCAAAAT TCAAACTTTC ACAACCTTAG GGAAAAATAA 3 9096 0 
TGTCAGCTTT TCAGCCAACT ATTAAGCGTC GTGAATCGAC AAAAATTTAT GTGGGAAATG 3 91020 
TACCAATTGG TGGGGATGCG C CTATTGC CG TGCAATCAAT GACAAATACT CGCACCACTG 3.1080 
ATGTGGAAGC GACAGTTGCT CAAATTAAAT CATTAGAACG TGTTGGTGCA GATATTGTTC 391X40 
GTGTATCTGT TCCAACAATG GATGCTGCGG AAGCATTTAA ACAAATTAAA CAACAAGTGA 3 9 1 200 
ATGTTCCGCT CGTAGCAGAT ATTCATTTCG ACTATCGTAT CGCGTTAAAA GTCGCAGAAT 391260 
ATGGAGTGGA TTGTTTACGT ATCAATCCTG GCAACATTGG TCGTGAAGAT CGCGTCCGTG 3 91320 
CCGTTGTTGA TTGTGCGCGA GACAAAAATA TTCCGATTCG TATTGGTGTA AATGCAGGCT 3.1380 
CTTTAGAAAA AGATTTGCAA GAAAAATATG G CGAACCAAC GCCAGAAGCC TTGTTAGAAT 331440 
CCGCATTGCG TCATGTAGAA ATTCTAGATC GTCTTAACTT CGATCAGTTT AAAGTGAGCG 3 9 l 500 
TAAAAGCCTC CGATGTATTC TTAGCGGTTG AATCTTATCG TTTACTGGCT AAAGCAATTA 39 1 S60 
AACAGCCTTT ACATTTAGGC ATTACAGAAG CAGGTGGCGC ACGCGCTGGT GCAGTAAAAT 39 1 620 
CTGCAGTGGG TTTAGGAATG TTATTAGCTG AGGGCATTGG CGATACACTA CGCGTCTCTT 391680 
TGGCGGCAGA TCCTGTAGAG GAAATCAAAG TCGGTTTTGA TATTTTGAAA TCTTTACGGA 3.1740 
TTCGTTCAAG AGGAATTAAC TTTATTG CTT GCCCAACCTG TTCTCGCCAA GAATTTGATG 3 91800 
TAATCGGTAC AGTAAATGCG CTAGAACAAC GCCTTGAAGA TATTATTACA CCAATGGATG 3 91860 
TATCTATTAT CGGTTGTGTA GTGAATGGTC CTGGCGAGGC ACTCGTCTCC GATCTCGGCG 3 91 920 
TAACGGGCGG TAACAAAAAA AGCGGTTATT ATCTTGACGG AGAACGCCAA AAAGAGCGTT 391980 
TTGATAACGA AGATATAGTG AACCAATTAG AAGCAAAAAT TCGTGCGAAA GTCGCACGAC 392040 
AAGATCCAAA AAACAGAATT ATTTAAGGAA AGCTCGTGGC AAAAACAATT CAAGCAATTC 392100 
GTGGTATGAA CGATTGTGCC CCAACAGAAT CTCCATTATG GCAATGGATT GAAGCACAAG 3 92160 
TACGCAATGT ATTAAATAGT TATGGCTATT CGGAAGTGCG TATGCCAATT GTAGAAAGTA 392220 
CGCCATTGTT TGCGCGTGCA ATCGGCGAAG TAACTGATGT TGTCTCAAAA GAAATGTACA 3 92280 
CATTTTGGGA TAACGATGAA CAATTAACCT TACGCCCTGA AGGCACAGCA GGTTGTGTGC 392340 
GAGCAGCCAT TGAACACGGT TGGATCTATA ATAATGAACA ACGCTTATGG TATATCGGGC 392400 
CAATG TTCCG TCATGAACGT CCACAAAAAG GGCGTTACCG TCAATTCCAT CAAGTAGGTG 3 92460 
TTGAGGTATT CGGTATTGCA AATCCAGAAA TCGATGCCGA ATTAATCATG CTTACCTATC 392520 
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GTTTATGGAA AGCATTAGGT ATCGATCAAC ATGTTACACT TCAACTTAAC TCAATTGGTT 3 92580 
CTTTAGAAGC TCGAGCAAAT TATCGTTCAG CTTTAGTTGG ATTCTTAGAA AACCATCAAG 392640 
ATTTAATGAG CGATGAAGAA AAAGAGCGTT TAGTAAGAAA TCCGCTTCGT ATTTTGGATA 3 92700 
CAAAAAATCC AGAATTACAA AAAGTATTAG ATAACGCGCC TAAATTGCTT GATTATTTAG 392760 
ATGATGAAAG CCGTACGCAT TTTGAACAAT TATGTTCATT ATTAGATGCA GTGGGTATTC 392820 
AATATGAAAT TAATCCTAAA CTCGTTCGTG GCTTAGATTA TTACAACAAA ACTGTATTTG 3 92880 
AATGGGTAAC CTCTGCTCTT GGCGCACAAG GTACCGTGTG TGGCGGCGGT CGTTATGACG 3 92940 
GTTTAGTGGA ACAACTTGGC GG AC AC G C AA CGCCAAGTAT CGGATTTGCG ATGGGATTAG 393000 
AGCGTTTAGT CTTGTTGGTG CAAGAAGTTA ATCCAAATGT TCCTGCGAAA AGTGCGGTGG 3 93060 
ATATTTATGT TGTTTATCAA GGCGAAGGCG CAACATTAGC CGCTTTTGAA TTAGCGGAAA 3 93120 
AAGTACGCTC GGAATTACCG CACTTAAATA CCATGTTGCA CTGCTCTAGC GGTAATTTTA 3 9318 0 
AAAAACAATT TAAACGTGCA GATAAATCTG GTGCAACATT GGCATTAGTT ATCGGCGAAA 3 93240 
GCGAAGTACA AAATAAACAA GTCGTTGTTA AACACTTACA AGGTGGCACA GATCAGCAAA 3 93300 
CCTTAGATTT AGTGAATATA ATCGACTATC TTCAAACTCA ATTTTAATTT AAAAGGGGAA 3 93360 
AACATGGCTT ATTCCATTGA AGAAGAACAA GAGATTAACC AGTTAAAAGA CTGGTGGAAA 3 93420 
GAGAATGGCA AAACAATTAT TGTCG CTTTT ATCTTAGGTG TGGGTGGTAT GTTTGGATGG 3 9348 0 
CGTTATTGGC AAACTCACCA AGCAGAACAA ATCGCTCAAG CTTCTGCACA ATACGATACT 3 9354 0 
TTAATTAATT CTGTGCAACA AGACGAACAG GCTAAAAAAG CAAATATAGA ACAATTTGTG 3 93600 
CAAGCGAATA GCAAAACTGC ATACGCTGTA TTTGCTTTGT TAGATGAAGC GAAAAAAGCC 3 93660 
ACTGAAAAAC AAGATTTTTC AGCAGCAGAA GCAAATTTAA ACCAAGCATT AACCCAATCT 3 93 720 
CAAGATGAAG TATTAACATC TATTGTTGCA TTACGTCTAA GTGCGGTGCA ATTCCAATTA 3 93780 
GG C CAATT AG AT AATG CTTT AAGCACATTA AACCAAGTGA AAGGCGAAAG TTTTAATGCT 3 9384 0 
CGCAAAGCGA TTTTAACGGG CGATATCCAA GTTGCAAAAG GAGATAAAGT TGCGGCGAAA 3 93 900 
AACAGCTTTG AGCAAGCACA ACAAAGTGGT AGCCAATTAG AACAACAAAT GGCAAAAATG 3 93 960 
AAATTAAACA ATCTTTAAAA AATAATACCC G CAT CAAAGT GCGGGTATTT TTTCAATCAA 3 9402 0 
ATTTCTGATT ATTCAAATTC ATCTAACCAT TTTAATAAAA TAGCCTCAAG AATTGATTCA 3 94080 
TTTGATTTAT TCGGATCATC GTCAAAATTC TCAAGTTCAC AAATCCATTT ATGTAAATCA 3 94140 
GTAAAACGAA CTGTTTTTGG ATCTAAATCT GGATTACGAT CGTAAAGTTC TTCTG CAATT 3 94200 

-77.236- 

SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO 9633276*1 I 



WO 96/33276 PCI7US96/05320 

AATTGGGCGT CTGTCCATTT CATTAGTGAG CCGCCTCATT TGCGTGATTA AGGTTATATT 3 942 6 V 
TTGGAATTTC TACCACAAGA TCTTCATTTC CCACGACGCA TTGACAAGAT AAACGGCTAT 394320 
CCATTTCTAA TCCCCAAGCC TTATCTAGCA TATCTTCTTC TTGATCACTC GTTTCATTTA 39 4380 
AAGAATCAAA CCCCTCACGA ACAATCACGT GGCAAGTTGT ACAAGCACAA GAACCATCAC 3 94440 
AAGCGTGATG AATTTCTACT CCCGCATTGT GAGCAACTTC TAATAAGTTA TCTCCTGTTG 394500 
CAGCATCAAC GACCATACCT TCAGGACAAA AATCTTCATT AGGTAAAAAA ATCACTTTTG 394 560 
GCATAATACT TTCCTCAAAA TTATTTAATA TCTGACTGCA GTTTTACGGA TTTTCAATGT 394620 
CTGACAGATT TTTACCTGTC AAAGCTTTAT TAATTGATGC ATTCATTCGG CGAGCAGCAA 394660 
ATTCTTGGGT AGCAGTGTCT AAGGCTTTAA TTCCTTGCGC AATTGCATCG CGATCTGAAC 394740 
CTTGTTTTAC ATCCATTAAT TGTTTTAACA CAGTTTCAAT ATGATGAAAT TCATCTGTGC 394800 
TAAGCAATTC AGCCCCATCT GCTTGTAGCG CAACAATCAC GCTTTCAATA ACACGATCTG 394860 
CTTCCACACG CTGCTCTGCC AATTCACGTG CTTGTAAATC TTCTTGTGCA TTATCAAAAG 394920 
AAGATTTAAT CATTGCGGTT ACTTCTTCAT CAGTTAATCC ATAAGATGGT TTAATTTGTA 394980 
TTGAAGCCTG TACTTTTGTC GATTTTTCCA TCGCTGTAAC GCTTAATAAA CCATCTGCAT 395040 
CCACTTGATA AGTCACTCTA ATATGGGCTG CCCCCGCCGC CATTGGTGGA ATACCTCGTA 395100 
GAGTAAAGCG TCCTAAAGAA CGACAATCAT CGACTAGCTC TCGTTCACCT TGTAGCACAT 395160 
GAACAGTCAT TGCCGTTTGC CCATCTTTAA AAGTAGTAAA CTCTTGCGCA CGAGCCACAG 395220 
GAATTGTTGT ATTACGTGGA ATAATTTTTT CCACTAATCC GCCCATCGTC TCAATGCCTA 395280 
AGGAAAGCGG GACAACATCA AGTAATAACA TATCAGAATC AGTTTTATTC CCGACTAAAA 3 9534 0 
TATCTGCTTG AATTGCAGCA CCAAGTGCCA CAACTTTATC AGGATCGATT GAGGTTAATG 39S400 
GTGTTTTACC AAAAAACTCT CCAACTTGTT CTCTTACATA AGGCACGCGA GTTGAACCAC 395460 
CAACCATTAC TACGGCTTGT ACTTCTTCGG ATTCTACATT GGCATCTTTC AATGCCCGAC 395520 
GACAAGTTAG CAATGAACGT TTTACAAGCG GATAAATTAA TTCATTGAAT TGTTCCCGCG 395580 
AAATCTCTAC TGAAAAATCT TGCCAAGAAA TAACCGCACT TTTTTCATTA GTCAAGGTAA 395640 
TTTTAGCTTG GTTGGCAAGG GTTATTAGCT CACGCTGTTG ATTTGCTGTT TGTGGTTTTA 3 95700 
GCTTAGTTTG TTCTATTACC CAATCTGCAA TTAAATGATC AAAATCATCT CCACCTAATG 3 95760 
CAGTATCTCC CCCCGTTGCT AAAACTTCAA AGATACCTTT TGATAAGCGT AAAATTGAAA 3 95820 
TATCAAATGT GCCACCGCCT AAATCATAAA CCGCAATAAT GCCTTCTTGC CCACTATCCA 39S880 
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ATCCATAAGC AAGTGCAGCT GCAGTAGGTT CATTCAATAA ACGTAATACA TTTAG CCCAG 3 95940 
CTAAACGCGC AGCATCTTTA GTACTTTGTC GTTGAGCATC ATCAAAATAA GCTGGAACCG 396000 
TAATCACAAC CCCAGAAAGC TCACCTCCAA GGCGTTGTTC TGCAATGTGA TTTAAGCGCG 396060 
ATAAAATATC TGAAGACACT TCAATTGGAC TTTTAGCCCC CTGTGCCGTA ATAATTAAAG 396120 
GTAATCCATT TTCACTCGCC ACAAATTCAT AAGGCAAACT AGAATAACGA GACTGAACAT 396180 
CAGAAAGGCT GCGACCAATT AAGCGTTTTA CTGAAATCAC CGTATTTTTG GGATCAAGTG 396240 
AGGCTTGTTC AAAGGCTTCT AAACCAACTT TCTTCTCTTC AACGCCATAA TGAACAACTG 396300 
ACGGCACAAG ACTGCGTTCT TGTTCATCAT TAAGAATCAC GGATTGCCCA CTACGCACGG 396360 
ATGCAACCAA AGAATTGGTT GTTCCCAAAT CAATCCCAAC GGCTAAACGA TGTTGATGCG 396420 
GTGCCGCAGC TTGTCCTGGT TCTGCAATTT GTAATAATGC CATATGCTAT TTTTCTACTA 396480 
GTTATGCGAC GGAATACGCC AAATCAAATT TATTTTTTTC TGCAAAGTTT GCTTGGTAAT 396540 
GGGTTTCAAT ATTGACTTCC TGTTTTTGAC GTTCATCAAT TGCTATTTTT CTGTCTTGGT 396600 
TTAAATCAGG CGTAAATACA AAAACTTGGT TACCATCTAA TTGTTTTACA TCATCTTGCC 396660 
AATTTTTCCA AAATAGCCCT TGATAAAAGG CCTCTACATC GCCATTCAAT ACCCAAGCTA 396720 
AAAATTCTGT GTAAGTAAAA TTTAAATTAT GCCAAGTAAG ATCTTTTGAT GAATAGTAAT 396780 
AAATTTTTCC AATATTATTG CCTAACGAAC CACCATTTAA CGCAAAATAA CCACCAATCA 3 9684 0 
CATCATCTGC AACTAATAAA TATTTTGGTT TTTCGCCTGA CTCACTAAAA GATTTACTGA 396900 
AATTCCAATC CATCAAG CCA CGCGGTAATT TAAAACTTC C AGAACCTAAA ATACGTAACC 396960 
AACCATAATG AATTAAAATT CCACCTGTTT CATAGATTAC TGCACCCATT GGCGAGGATG 397020 
TTGGCATTTG CATAGTAAAA AGCTCACGCT CCGCACTGGA TTGATCTTTT TTAATTACTT 397080 
CACAATGATT GCGAGCGTTA TCAATCCATT GGGAAAGGGT TATCCAAGCA GAATGTTCTG 3 9714 0 
GG CTAG TG AG CTG TT CAAG A GTTTGCATAG TGTTATTTAT TCACATCTCG TAGGGGCAAA 397200 
TTACATTTGC CCCCACATTT AATCTAATTC AAAAATTCGT TCTTCCACAC GTTCAATTTC 397260 
TTCAGAAAGT TTATGGGTAA AACGTAATTT ATCACAGTAT TGAGATGCTC TCGCCCATTG 397320 
TTGAGATTTC AAACTCTCAA AAAGTGCGGT CAATAATGAT TGAGTTTCTT GTTTAATTTC 397380 
TTTAGCAAAA GCATTTAATG CTCTTTCATC TTGCTGGCGT TCAACTTCTT CTAATTGTTC 397440 
GCGCCATTGT AATTGTTGCA TTAAAAACGC CACATCTTGC GTGCTTTTTT GTTCAAGATC 397500 
TAACTGCTCA CCAGTATTAA GAG CAATGAT CGCTTCAGCT CGCAAAATTG GGTCTTTTAA 3 97560 

-77.238- 

SUBSTITUTE SHEET (RULE 26) 



BNSOOCID: <WO_963327aA 1 J^> 



WO 96/33276 



PCT/US96/05320 



CGTTTTTAAG GCATCATTCA CTTCGGTAGA TTTTTGCATC GCAACACGTT GATCTAAAGC 3 9 7620 

ATTACTTGAG ACAAAATTAT CTGGATGTAG GGCCTTTTGT AATTTTAAAT AACGTGCATT 3 97680 

TAGTACTTTT TCATCCAGCT GAAAATCAAC TGGCAAATCA AAAATTTGAA AAGGATTGAA 3 9 7740 

CATCAGATTA TT AC CTT AAA CGTGGAAACT TTCGCCACAA CCACAAGATT CTTTTACATT 3 97800 

TGGATTATTA TATTTAAAGC CTTCATTTAG GCCTTCTTTT ACATAATCCA GCTCAATGCC 3 9786 0 

ATTGAGATAA ACGAGACTTT TTGGATCGAC AATAATATTA ACACCGTACT GCTCAAAAAC 3 9792 0 

TTGATCTTCG GAATTTAAAA CATCTACAAA CTCAAGTACA TAAG CTAAGC CAGAACAACC 3 9798 0 

TGATGTTTTC ACACCTAAAC GTAATCCAAT ACCTTTACCA CGATTATCCA AAAAGGCTTT 3 9804 0 

CACTCGCTGC GCTGCTTTTT CTGTTAATGT AATACCCATC TATCCCCCAA CACCGCTAAA 3 98100 

AATGCGGTCA AAATTGACCG CACTTTTTCA CATGATTAGC CTTGTTTTGC TTTATAATCA 3 9816 0 

GCAATTGCTG CTTTAATTGC ATCTTCTGCT AAAATAGAGC AATGCACTTT CACTGGCGGT 3 9B22 0 

AATTCTAATT CTTCAGCAAT TTGGCTATTT TTAATTGCTC CAGCTTCTTC TAAAGATTTA 3 98280 

CCTTTTACCC ACTCAGTAAT TAATGAACTA GATGCAATCG CAGAACCACA GCCGTAAGTT 3 98340 

TTGAATTTTG CATCTTCAAT AATGCCATTA TCATCTACTT TGATTTGTAG TTGCATAACA 3 98400 

TCACCACAAG CTGGCGCACC AACCATTCCA GTACCAACAT TGCTATCTTT TTTATCCAAT 3 9846 0 

GATCCCACAT TACGTGGATT TTCATAATGA TCAATTACTT TTTCGCTATA AGCCATTTTA 3 9852 0 

ACTTTCCTTT TTTCAATTCT AGTGAATGCC ATTGT CAGAT AATGGCAACT AAATATTAGT 3 98 580 

GAGCTGACCA CTCAATGGTA TTTAAATCAA TACCTTCTTT AAACATATC C CATAATGGCG 3 9864 0 

ATAATG CACG AaGTTTTTCT ACCGCACCCT TCATTAAATT AATTGTGTAA TCAATTTCTT 3 98 700 

CTTCAGTAGT ATAACGACCT AATGTGAAAC GAATTGAACT GTGCGCCAAT TCATCATTTA 398760 

AGCCAAGTGC G CGCAATAC A TAAGATGGCT CTAAGCTAGC AGATGTACAA GCAGAACCAG 3 98820 

AAGAGACAG C AATATCGCGC AACGCCATCA TTAAGGATTC CCCCTCAACA TAGTTAAAAC 3 988 80 

TAATATTTAA ATTGCTATCt AAACGATGTT CCATTGAACC ATTTACATAA GTTTCTTCAA 3 98 94 0 

TATCTTTTAA GCCATTATAT AAACGATCAC GAAGTGCTTT TAAACGCGGC ATTTCTGATG 3 9900 0 

CCATTTCTTC TTTTGCAATT CGATATGCCT CTCCCATTCC TACGATTTGA TGAACAGGTA 3 9906 0 

ATGTAC CTGA ACGCATACCA CGTTCGTGAC CACCACCATG AATAATCGCC TCTAAACGTA 3 9912 0 

CACGAGGTTT ACGACGAACA TACAATGCGC CAACACCTTT TGGCCCGTAA AGTTTATGGC 3 99180 

TAGACATCGA CATTAAATCA ACAGCTAATT CTTCCAAATT AATTTCAACT TTACCAACAC 3 99240 
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TTTGAGTTGC ATCCACATGG AAAATTGTTT TATTTGCACG ACAAAGTTCG CCAATAGCTT 399300 
TAATATCTTG TAACACACCA ATTTCATTAT TAGCGTGCAT AATTGAAGCT AAAATTGTAT 399360 
CTGGACGAAG CGCAGCTTTG AATTTTTCTA AATCAATTAA ACCATCAGCT TCTGGAGACA 399420 
AATAAGTGAC TTCAAAACCT TCTCGTTCTA ATTGACGGCA AGTATCTAAC ACTGCTTTAT 399480 
GTTCAGTTTT ACAGGTAATA ATGTGTTTAC CTTTAGTTTG ATAGAAATGC GCCGCACCTT 39 9540 
TAATAGCGAG GTTATCGGAT TCTGTCGCAC CAGAAGTAAA GACAATTTCA CGAGAATCTG 399600 
CGCCAATGAG ATCAGCGATT TGATTACGCG CAATATCAAC AGCTTCTTCT GCTTGCCAAC 399660 
CAAATTTATG TGAGCGAGAG GCTGGATTAC CAAATGTGCC ATCGATGGTT AGAAACGCCA 39 9720 
TCATTTTTTT CGCAACACGT TCATCGACTG GACAGGTCGC TGCGTAGTCT AAATAAATAG 399780 
GTAATTTCAT TTTCACTCCT AAATTATGCT AAATATAACC GCACTTTTTT ATTAAAAAAT 399840 
ATACTG CTTA TCTTTTTTAT TGATTAACTA ATAAATTATT AAAATCACGA TGAGATTGAC 399900 
GTTTTACATT ACG CTTATTT ACCAGCTCTG CTAACGTAAT CTCATTTAAA AAACTTTCAA 399960 
TACGTAAACT TAAATCTTCC CAAAGTTCGT GAGTTAAACA TTCCACACCA TTTTTACAAT 400020 
TTTCACGCCC CAGACATTTC GTTACGTGAA TATTTTCATT TACTGCCGCA ATAATCATTC 400080 
CTACAGAAAT TTGTTCACTT GGTAGCCCTA ACTGATAACC ACCTCCTGGC CCGCGAACAC 400140 
TTTTGACTAA ACCGTCTTTA CGTAATTTAG CAAAAAGTTG CTCTAAATAA GATAAGGAAA 400200 
TATGTTGGCG TTCAGAAATA TCCGCAAGGC TTACAGGCCC GCCATCTGCA TTTAAAGCAA 400260 
TATCTAAAAC GGCTGTAACT GCATAACGAC CTTTCGATGT AAGTTTCATA AAGAATCCCT 400320 
AAAAATTAAT TGCGCAATAC TTACATATCT GACTAATTTA GTCAATTATC CTCTTTTGTT 400380 
AAATCAATTC GTTTTTCAAC CGCACTTAGC ATGCCATTTA AAATATTTAA TTCGTTTTTC 4 0044 0 
TCTAATTTTG CACGATAGTA TAAACGCTTT AATTTACGCA TAACTCCTTG ATTTTGAATA 400S00 
AATCCAAGAG ACTGATATAT ACGTTCTGTG TAGTCAAAAA AATAAGCTAA CTGATCTGTT 400560 
GTTGGATAGT TTTTCTCTAT TAAAGAAAGT GAATTTTTCT TATTATTTTG AACTAAGAAA 400620 
GCCAkACGCA ATTCATAACT TACCAGCTGC ACAGCCATTG CTAAATTTAA AGAAGAATAA 400680 
TCAGGATTCG CTGGAATATT CAAGTGATAA TGACATTTCA GCAGTTCTTC ATTGGTTAAT 400740 
CCAATACGCT CACGACCAAA CACAATAGCT ATCTTGCCTT TATACGCTAC AACTTTCTCC 400800 
GCGCATTCTC GAGGCTCGAT CAATGTATTT TGTAAATGAC GTAAGCGAG C ACTTGTTCCA 400860 
ATTACCAAAG GGCAATCATC GACGGCTTCA TCAAAACTAT CGACAACCCT AGCATTTTTT 400920 
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ACAATATTTT CTGCGCCAGC TGAAAGGGCA TAAGATTGTT CATCAACAGA TTTAGGTGAA 4 00980 

ACAAGACAAA GCTGGGTCAA TCCCATCGTT TTCATTGCTC TAGCCGCAGA ACCAATATTA 401040 

CCACTGTGCG AAGTTTCAAT TAAAACAATA CGAATATTTT CCAACATAAT GATAC CCGAC 401100 

TAAAATTTAA GGGTATTCTA CCACAATAAC CGCTAAAAAT AGTCAAGAAT TTAAGCTAAT 4 01160 

TAAAGAAATT TAAAATATTA GGGATGAATA ATAAAAATGG TGGGTTGTGA GAGACTAGAA 4 0122 0 

CTCTCGACTG ACGGATTAAG AGTCCGCTAC TCTACCAACT GAG CTAACAA CCCAACTGGA 4 01280 

CAACTAAATC AGAGAGTGGT GGGTCGTGAA GGATTCGAAC CTTCGACCAA CGGATTAAAA 4 0134 0 

GTCCGCTGCT CTACCGACTG AGCTAACGAC CCCACTGTAT TAAATGGATT TATGATAACG 4014 00 

CATTTGCTCT AGAACCGAAA GAATAAATTC ATTCTTTCAA GCTAATTGGT TAAATTACAA 4 01460 

ACGATGAAAA CAAAACAGGA nCCAATAAAA AATGGATCCT GTTTTTCAAG TATTAGAAAT 4 01520 

ACTAAGAATT AGTACGCTAA CACTGCACGA CGGTTTTTAG AATATGCAGC TT CAT CATGA 4 0158 0 

CCTAATACTG CAGGTTTTTC TTCACCGTAA GATACTGTGC CTAATTTACC AGCATCAACA 4 01640 

CCTTTACCAG CTAAATAACC TTTAACTGCA TCTGCACGAC GTTGGCCTAA TGCGATGTTG 4 017 0 0 

TATTCTGGTG TACCACGTTC ATCAGTGTTA CCTTCTACTA ATACTTTAGC AGCTGGCGTT 401760 

GCATTTAAAT ATGCAGCGTG CGCGTCTAAG ATTTGAACGT ATTCACCAGT AATGTCATAT 4 0182 0 

TTATCAAAAC CGAAATAAAC GGTATTGTAA CGTTGTTGAA GATCAGCAAC AGAGTAACCG 4 01880 

CCAAAAGTTT GAGCAGCACC ATTGCCTGCA GCATCGTTGT TAGATGAACT ACAAGCTGCT 4 0194 0 

AATGCAGCTA CAGAACCTGC AACTAATAAT GATTTAACAA ATTTGTT CAT TAGATTTCTC 4 02000 

CTAAATGAGT TTTTTATTTA GTTAAGTATG GAGACCAAGC TGGAAATTTA ACTTGACCAT 4 02060 

CACTTCCTGG AAGGCTCGCC TTAAAGCGAC CATCTGCGGA AACCAATTGT AGCACCTTTC 4 02120 

CTAAGCCCTG TGTAGAACTA TAAATAATCA TAATTCCATT TGGAGAGAGG CTTGGGCTTT 4 0218 0 

CGCCTAGAAA AGATGTACTA AGTACCTCTG AAACGCCCGT TGTGAGATCT TGTTTAACTA 4 02240 

CATTATTGTT AC CATT AATC ATCACAAGTG TTTTTCCATC TGCACTAATT TGTGCGCTAC 4 023 00 

CGCGACCACC CACTGCTGTT GCACTACCAC CGCTTGCATC CATTCGATAA ACTTGTGGCG 4 02 36 0 

AACCACTTCT ATCGGATGTA AATAAAATTG AATTTCCGTC TGGCGAC CAC GCTGGTTCAG 4 0242 0 

TATTATTACC CGCACCACTC GTCAATTGAG TAGGTGTACC GCCATTTGCT CCCATAACGT 4 02480 

AAATATTCAG AACAC CAT CA CGAGAAGAAG CAAAAGCTAA ACGAGAACCA TCTGGCGAAA 4 0254 0 

AGGCTGGTGC GCCATTATGC CCTTGAAAAG ATGCCACTAC TTTACGTGCG CCAGAATTTA 4 02600 
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AATCCTGTAC AACAAGTTGT GATTTTTTAT TTTCAAACGA TACATAAGCC AAACGCTGGC 4 02660 

CGTCTGGAGA CCAAGCTGGA GACATAATTG GTTGGGCACT ACGATTGACG ATAAATTGAT 4 02720 

TATAGCCATC ATAATCTGCT ACACGAACTT CATAAGGTTG CGAACCGCCA TTTTTTTGCA 4 02 780 

CAACATAAGC GATACGAGTT CTAAAGGCAC CACGGATCGC AGTTAATTTT TCAAAAACTT 4 0284 0 

CATCGCTCAC AGTATGCGCG CCATAGCGTA ACCATTTATT TGTTACTGTA TAGCTATTTT 4 02900 

GCATTAATAC AGTCCCTGGC GTACCTGATG CACCAACCGT ATCAATTAAT TGATAAGTAA 4 02 960 

T ACT AT AAC C ATTACCCGAT GGAACCACTT GCC CAATTAC AATTGCGTCA ATTCCAATAT 4 03 020 

TCGACCAAGC CTCAGGATTT ACCTCTG CAG CTGAAGTTGG GCGTTGAGGC ATTTGAGAAA 4 03 080 

CCGCAATAGG ATTAAACTTA C CACTGTT AC GTAAATCATC TGCAACAATT TTACTAATAT 4 03140 

CTTCTGGTGC AG AAC CAACA AATGGCACGA CAGCAATAGG ACGCGCACCA TCAACCCCTT 4 032 00 

CATCAATGAC AATGCGTACT TCATCGCCAG CGAATGCATT GCTTCCAACA GCAAGTACAA 4 03260 

TCGCGAATAC GCTCACTAAA CGTTTTAATA ATTTCATTTT GTTACCTTTA AAATTTAACA 4 03 32 0 

ATAAATTTTT CTAAAGAATT ATCGAATATC AAAGTCAATA ATTGGTGATT TATATTTTTC 4 0338 0 

ATAAATTTCA TCTGATGGCG CAGCTGGAAC TT TTTT CGTT CTAGCCACCG CACTTAATGC 40344 0'. 

AG CTGAACAA ATATCATCAG AGCCTGAAAT TTTTTGATAC CCCAAGATTG TGCCATCTCG 4 03 500 

AC CTAATTGA ATTTTAATAC GACAAACCTT TCCTGCAAAA TTTGGATCTT TTAAGAAACG 403 56 0 

ACGTTGAATC TCTTTCTTAA TTACACCTGC GTATTGATCC CCAACCTTAC CACCATCGCC 4 03620 

AGAGCCAAGT GCAGCACCGC TACCTTGAGT TCCACCTTTA TTTGTGTTTC CCCCTTTAGA 4 036 8 0 

TGCACTACCG CCACCAATAT CTCCGCCATT TAAGAAATCA TCTAGGCTTG CTTGATCTGC 4 03 74 0 

TTTACGTTTC GCTTCCGTAG CAGCTTTAGC TTCTGCATCA GCTTTTGCTT TTGCCTCTGC 4 03 8 00 

TGCCGCTTTC GCTTTTGCCT CCGCTTCAGC CTTTGCTTTA GCTTCAGCAA CGGCTTTTGC 40386 0 

CTTAGCTTCG GCTTCTAGTT TCGCCTTAGC TTCCGCCTCT TGTTTTGCTT TTTGAGCAGC 4 03 92 0 

AATTTCTGCT GCTTTCGCTT TAGCCTCTTC TTCAGCTTGT TTTGCCGCGG CAGCTAAACG 4 03 980 

TTTAGCCTCT GCATCTGCTT TTAATTTTGC AGCTTCAGCC GCTTGTTTAG CCTTAGCCTC 4 04 04 0 

TTCAGCTTGC TTCTGTTTTT CCAACGCTTC TTGACGAGCT TGCTCTTGTT GTTTTTTTAT 4 04100 

TTCTTGCTGA CGTTGCTGTT CTTGCTGACG T T T TAACTCT TCTTGTCGTT GAACTTCTTG 40416 0 

TTGATGCTTA ATCTCTTCTT GATTAGGCTC AGGTGGTTTT TCTTCCACAA CAGGTTCTGG 4 0422 0 

GCGTTTTTGT TTATCCGCTT GCCCTTTTTT TTGTTGTTGA ATACGCCCCC ATTCCTGAGC 404280 
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AGCCGTACCA GTATCAACAA TCACTGCCCC TATTACATCT CCTTCACCTT CTCCACCACC 404340 
CATAATTTCA ACAGTGTGAT AAAGTGAGCT TAAAATCAAT AAGCCAAACA AGATAAAGTG 404400 
CAAAAGGATA GAAATAGCAA AAGCATTGAT TCCTTTCTTT TGTCGATTAT TTTGCACGTG 404460 
TTACCTACTT AGCTAAATGG GATTTGTCAT TAAACCTACA GATTTAATGC CTGCAAGATG 4 04520 
AAGTAAATTC AATGCCTTAA TCACTTCTTC ATAAGGTACT TCTTTAGCTC CGCCTACTAA 404580 
AAATAGCGTA TTATTATCCT TATCAAATTC CTGTCTAGAT AATTGAGTAA CCATTTCTTC 404640 
TGTTAAACCT TCTTGACGTT CTCCGCCAAT AGAAATCGCA TATTTTCCAA TGCCTGCCAC 404700 
TTCAAGAATG ACGGGTACTT TATCTTCATT AGAAACCTCT TGGCTTTGCA CAGAATCAGG 4 04760 
CAATTCAACT TGAACGCTTT GACTAATAAT AGGGGCGGTT GCCATAAAAA TTAACACTAA 4 04820 
AACTAAAAGC ACATCTAAAA AAGGCACAAT ATTAATTTCA GATTTAATTG CTTTACGCTG 404880 
ACGACGAGCC ATATATTCCT CTAAAATTTT AACTTATTTT TACCGCACTT TTTCTTCAAA 4 04 94 0 
GTGCGGTCAA TTTTCCCTAT ATTTTAGTGA GGGGCTTTAC CAAAGGCTTG ACGGTGTAAA 4 05000 
ATCGTCGTAA ATTCATCAAT AAAATTACCG TAATCTTGTT CAATGGCATT CACTCGTAAG 4 05060 
CTTAAACGGT TATAAGCCAT TACTGCAGGA ATTGCGGCAA ATAAACCAAT CGCAGTGGCA 4 05120 
ATCAAGGCCT CAGCGATACC TGGCGCTACC ATCTGTAACG TTGCTTGTTT TGCACCACTT 4 05180 
AATGCCATAA AAGCGTGCAT GATACCCCAA ACAGTGCCGA ATAAACCAAT ATAAGGGCTA 4 05240 
ACAGATGCCA CTGTGGCTAA AAATGGAACT CGGTTTTCCA AACTTTCAAT CTCACGGTTC 4 0 5300 
ATCGCAAGAT TCATCGCGCG CATTGTGCCT TTAATAATCG CTTCAGGTGC ATCTGGATTT 4 05360 
ACTTGTTTTA AACGTGAAAA TTCTTTAAAT CCCACGCAAA AAATTTGTTC GCTGCCCGTT 4 05420 
AATCCATCGC GACGATTAGA TAGCCCTTCA TAAAGTTTAT TTAAATCTTC TCCTGACCAG 4 05480 
AAACGATCTT CAAACGTACG CGCTTCTTTT AAGGCATTCG TTAAAATACG ACTACGTTGA 4 0554 0 
ATGATAATTG CCCAAGATAT GATTGAGAAA GAAATCAAAA TCACAATTAC CAGTTGCACA 405600 
ACAATACTTG CTTTTAGAAA AAGATCTAAA AAATTCAATT CTGCAGTCAT TGCATACTCC 4 05660 
GAAAAATTAT TTTAAGTGAT GAAACGCCGC TTTAACTTCT TTGGGAAACG CCACTGGTTT 4 05720 
CATCTTGCCT AGATCAACAC AGGCTACCTT AACAGTAGCC TTTGATAACA TCAGGGTGTT 405780 
GCGCATCAGT CTCTGTTCAA AAAGGATTGT AGCCCCTTTT ACTTCTGAAA CCTCTGTTTC 4 05840 
CACCATAAGT AAATCATCCA ATTTTGCTGC CACGCAATAA TCAATGGCGA GCGTTTTGAC 4 05900 
AACAAATGCG AGTTGTTGTT CCTCTAGTAA GGTTTGTTGC GTAAAATTTA ATGTACGCAA 405960 
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ATATTCTGTT CTTGCTCGTT CAAAAAAATG CAAATAGCGA GCGTGATACA CTACGCCACC 406020 
TGCATCAGTA TCTTCATAAT ACACACGAAC AGGAAAAGAA AAGCCATTAT CCAACATATT 406080 
CTCACCCAAT TGGTCGCAAT AAACCGTGTA TTCTAGAACC AGTTTTTGGG ATAAGCAAGC 406140 
TATCTATGAA AAACTCAATA AGATTTTATT CATTTTAAAA CATCTAAAAT TTTTACCGCA 40o200 
CTTTTAGCCT GACTAGCAAA AGATAAGGTA ATGACAAATC ATTTTTAACC TTTCTCATTG 406260 
AGTAAAATCT ATTCAAAACA TAACCGTTCT TTAAAAATAG CCTCTATGTA ATCTTAAGCC 406320 
ACCAGTATTT TTATTCTTGA TATTTAGCGT TTCTATGCGA CAATCTTTGC GGTTATTTAC 406380 
TTTAAAAATA TGTTTTACTA GATGGATTAC GAAAATCAAA TTGCCAATAT TTTCTCACTA 4 06440 
AATGGCGAAT TAAGCCAAAA TATCAAAGGT TTTCGTCCTC GAGCTGAACA ACTTGAAATG 406500 
GCATATGCTG TAGGTAAAGC AATTCAAAAT AAATCTTCCC TTGTTATTGA AGCTGGAACG 406560 
GGTACAGGAA AAACCTTTGC ATATCTCGCA CCTGCTTTAG TTTTTGGTAA AAAAACCATT 406620 
ATTTCTACAG GATCTAAAAA TCTTCAAGAT CAGCTTTTTA ATCGAGATCT TCCAGCTATT 406680 
AAAAAAGCCC TTAATTTCAC GGGAAAAATT GCGTTGCTAA AAGGTCGGGC AAACTATCTT 406740 
TGTTTAGAGC GTTTAGATCA AGTGATTGCT CAAGGTGTGC TTGGAGATAA GTCTGTTTTA 406800 
GCTGAATTAA GTAAAGTACG AAAATGGAAT AATAGTACTA AAACAGGCGA TTTCACAGAA 406860 
TGTATCGAGC TTGCAGAAGA TAGCCCAATT ATTCCTCAAC TGACGAGTAC AGCGGAAAGT 406920 
TGTTTAGGCA CCGATTGCCC TAATTATTCA GAATGTTATG TAGCAAGTGC GCGTAAAAAA 406980 
GCATTAAATG CGGATTTGGT TGTAGTTAAT CATCATTTAT TTTTTGCTGA TATGGCTGTG 407040 
AAAGAAAGTG GTTTTGGCGA ACTTATTCCA AATGCAGAAG TGATTATTTT TGACGAAGCT 407100 
CAT CAG TT AC CCGATATTGC CAGTCAATAT TTTGGTCAAT CTTTAACATC ACGCCAGCTT 407160 
TTTGATTTGT GTAAGGATAT TAATATTGTT TACCGCACAG AACTTAAAGA TATGCAGCAG 407220 
CTTGGCACAA CATCTGATAC CTTACTGAAA GTGGTACAAG ATTTTCGTTT ATTACTGGGT 407280 
AATGGGAGTA ATGTTCGAGG AAATTGGCGT GAGCTTTACA CGCAAAGTGC GGTGAAAAAA 407340 
GCCTTTGAAT TATTACAAGA GAAAATAGAT TTTTTATCTG AAGTAATTAA ATTAGCACTT 407400 
GGTCGTTCCC AAACTTTAGA CAGTATTTTT GAGCGTGTTG AGTCCATTAA AATCCAACTC 407460 
AAACGTTTAT CTGAAACAAA TATTGTGGGT TATTGTTATT GGTATGAAGG AAATGGTCGC 407520 
CAGTTTGGTT TACATATTAC GCCATTGACA GTAGCTGATA AATTTGGTGC TCAGCTCGAA 407580 
GCGAAAGAAG CCGCTTGGAT ATTCACATCT GCGACATTAG AAGTGGGGGG AACTTTTAAT 407640 
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CATTTCTGCC AACGTTTAGG CATTGAAAAT GCGACACAAA AAATTTTATA TAGTCCCTTT 4 07700 
AATT AT C CTG AACAATCTTT GCTTTGTGTG CCTCGATATT TGCCGAATAC CAATCAAATG 407760 
AAT AC ATT AA ATTCACTAGG CGAAATATTA TTACCCGTGA TTGAAGCAAA CAAAGGGCGT 407820 
TGTTTTGTTC TTTGCACTTC CTATTCAATG ATGCGTGGTT TAGCTGAATA TTTCCGTGAA 407880 
AAAAGTCATC TTTCGATTTT ATTACAAGGC GAAACCTCAA AAGGCAAATT ACTCGAACAA 407940 
TTTATAAAAG AAACGCATAG CGTTTTGGTG GCAACTTCTA GCTTTTGGGA AGGTGTAGAT 408000 
GTGCGAGGCG ATGCACTTTC TTTAGTGATT ATTGATAAAC TTCCTTTTAC TGCGCCAGAT 403060 
GAGCCTTTGT TGAAAGCTCG CATTGAGGAT TGTCGTTTAC AAGGTGGCGA TCCATTCAAT 408120 
GATATTCAAA TTCCTGAAGC GGTAATTACC TTGAAACAAG GCGTTGGACG TTTAATTCGT 408180 
GATGTTACAG ATCGTGGCGT TGTGATTATT TGTGATAATC GATTAGTGAT GCGCAATTAT 408240 
GGCGAAACTT TTTTGAAAAG TTTACCTAAC TCAAGTCGTA C C CGTGATCT CAACAAAGTG 4 083 00 
ATACAATTCT TACAAAATAA GTAAACGAGA ATAAGAATGC AAAATTTAAC TTTATTGGCG 408360 
TTGGACACCT CTACTGAAGC CTGTTCTGTC GCTTTATTGT ATCGTGGTGA GAAAACACAT 408420 
ATT AAT G AAC TTGCCCAACG CACACACACT AAACGAATTT TACCTATGAT CGATGAAATT 408480 
TTGGCAAATT CGGGTTTAGG TTTAAATCAA GTTGATGCTT TAGCTTTTGG GCGTGGGCCT 408540 
GGTAGTTTTA CTGGTGTTCG TGTTGGTGCT GGGATTGCAC AAGGTTTAGC GTTTGGTGCG 408600 
GATTTGCCTG TCATTCCAAT TTCAAATTTA ACCGCAATGG CTCAGGCAGC ATTTGAATTA 408660 
CATCAAGCAG AAAATGTTGT TGCCGCGATT GATGCGAGAA TGAATGAAGT CTATTTTTCT 408720 
CAAGTAGTGA GAGAAAAAGT GCGGTCAGAT TTTGGGGAAT TTTTTCAATG GCGAGAAATC 4 087 80 
ATTAGCGAAC AAGTTTGTTC TCCAGAACAA GCGATTAACC AGCTTCAAAA TGATAACGCA 408840 
TTTAGAGTCG GAACAGGCTG GGCTGCTTAT TCTCAATTTA CTGAAAAAAA TCTAACTGGC 4 08900 
TCAGATATAG CATTGCCTAA TGCCTTATAT ATGTTAGAAC TTGCGCAGGT AGAATACTTG 408960 
CAAAAACACA CAATTTCAGC TTTAGAAATT GAACCGATTT ATTTGCGCAA CGAAGTTACT 409020 
TGGAAAAAAT TACCAGGGCG TGAATAAGTT TAAATAACTG GAGGATAGAA AATGAAAGGA 409080 
AAAATCACCT TATTTTTTAC CGCACTTTGT TTCGGATTAA CGGGCTGTAT TGCACCACCA 409140 
AAAGGGTTAG AAAAAGAGCG ATTCTCAATT AATTCCTATC GCGAGATTTC CCCTCAGGAT 409200 
TTGACCTGTC ATTGTAAAAC AGTTCGACTT GGAGGAAAAA TTGTCAATAC TACCGTTTTA 40926 0 
GCAAATCAAA CAAAAATTGA AGTGTTAAGT TTACCCGTAT CAT C AATTT C AGCTAAACCA 409320 
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TTTGTTGAAT TGCAATCCGA TGGTCGCTTT ATCGTGTATT TCAACGGTTT TGTTGAGCCT 4 0 938 0 

GAAAATTTAA AAGAACGTTA TATTACTGTA GGTGGTCAAT TAGCTGGAAC AGAGAAAGGC 409440 

AAAATAGAAC AAGCTGATTA TACTTATCCT GTTGTTCAAG CGGATAAATA CCGTATTTGG 4 09500 

ACACTCAGTA CCACCTATGA TTATCCAACA GATGATTGGG ATGAAGACGA TGATTGGGGA 4 0956 0 

TTTTTTAGAT GGAGACATCG CCTTTGGTAT GTTCAGCCTG AAATTCAC T A TTATTTGAAT 409620 

TAAATGATAA AATCTTAAAG ATTTTCCAAC AAATTAGGAT AAAAAATGGA AAAAATTTGG 4 09680 

TTTCAAAATT AT C CAAAAGG TTCTGAAAAA TTTCTAGATA CCTCAAAATA TGAATCTATT 4 0974 0 

TTGGATATGT TCGACAAAGC AGTGCGCGAA CATCCAGATC GCCCCGCTTA CATTAATATG 4 0 9800 

GGGCAAG TAC TTACTTTCCG TAAATTAGAA GAGCGCAGTC GTGCATTTGC CGCATATTTG 40 9860 

CAAAATGAAT TTAAACTTCA ACGTGGAGAT CGCGTGGCAT TGATGATGCC AAATTTATTG 4 0 992 0 

CAATATCCTA TCGCACTTTT TGGTATTTTG CGTGCTGGCT TAATTGCAGT AAATGTCAAT 4 0998 0 

CCTCTTTACA CACCGAGAGA ATTAGAGCTC CAGTTGCAAG ATAGTGGTGC GGTAG CGATT 41004 0 

GTCGTGGTAT CAAATTTTGC TTCTACCTTG GAAAAAGTCG TTTTCAATAC CAATGTTAAA 410100 

CACGTTATTC TTACCCGAAT GGGCGATCAA CTTTCTTTTG GTAAACGTAC TTTAGTCAAT 41016 0 

TTTGTCGTAA AATATGTGAA AAAACTGGTG CCGAAATATA AATTACCGCA CGCAGTGACG 410220 

TTCCGAGAAG TATTGAGTAT TGGTAAATAT CGCCAATATG TTCGTC CAG A AATATCGCGT 410280 

GAGGATTTAG CCTTTTTACA ATACACTGGC GGTACAACTG GCGTGGCGAA AGGTGCAATG 41034 0 

CTAACGCACG GAAATATTAT AACCAATGTC TTCCAAGCAA AATGGATTGC AGAGCCATTT 410400 

ATTGGCGATC ATTCTCGCAC TCGTTCAGCT ATTTTAG CCT TGCCTCTTTA TCACGTATTT 410460 

GCCTTAACGG TAAACTGCTT ATTATTTTTA GAGTTAGGTG TCACGGCGAT CTTAATCACC 410520 

AATCCGAGAG ATATTGAAGG ATTTGTGAAA GAATTGAAAA AATATCGTTT TGAAGCAATC 410580 

ACTGGTGTGA ATACCTTATT TAATGCGTTG TTAAATAATG AAAATTTCAA AGAAGTTGAT 410640 

TTTTCTGCGT TGAAACTTTC TGTGGGTGGC GGTATGGCTA TTCAACAATC AGTGGCAACA 410700 

CGTTGGCACG AATTGACGGG CTGTAACATC ATTGAAGGTT ATGGAATGAC GGAATGTTCG 410 760 

CCATTAATCG CCGCTTGTCC GATTAATGTT GTGAAACATA ATGGCACTAT TGGTGTGCCA 410820 

GTGCCAAATA CAGATATAAA AATCATTAAA GATGATGGCT CTGATGCAAA AATTGGCGAA 4108 8 0 

GCTGGCGAAT TATGGGTGAA AGGCGATCAA GTAATGCGAG GTTATTGGCA ACGACCAGAA 41094 0 

GCAACCAGTG AAGTGTTAAA AGATGGTTGG ATGGCAACTG GCGATATTGT CATTATGGAT 411000 
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GAATCATATA GCTTACG CAT TGTAGATCGT AAAAAAGATA TAATTTTGGT TTCTGGCTTT 411060 

AATGTCTATC CAAATGAAAT TGAAGATGTC GTTATGCTGA ATTACAAAGT GTCTGAGGCT 411120 

GTGGCAATTG GTGTTCCTCA CGCAGTTTCT GGCGAAACGA TAAAAATCTT TGTGGTGAAA 411180 

AAGGACGATA GTTTAACTCG TGATGAACTT CGCAACCATT GTCGTCAATA TCTCACAGGC 41124 0 

TATAAAGTGC CAAAAGAAAT TGAGTTCCGA GACGAACTAC CAAAAACTAA TGTTGGCAAG 411300 

ATTTTACGTC GGGTTTTACG CGATGAAGAG ATCGCAAAAC GTCCAAAACA TTAATGTCAA 411360 

AATGGGATAG CAATATCCCA TTTTTGTTCT TTTCAAATAA AAAAATCACT ATGATAAAAG 411420 

AATGTCAAAA TCCACCGCAC TTTAGGGTCG TTACGGATAA TACTGCGCTA TTAGAAGTTT 411480 

GTAATTTAGC CCAACAGAAA AGTGCGGTCG C CTT AG AT AC AGAATTTATG CGTGTTTCCA 41154 0 

CTTATTTTCC CAAGTTGGGA TTAATCCAGC TTTATGATGG AGAGCACGTC TCTTTAATTG 411600 

ATCCTTTAGC GATTACGGAT TTTTCGCCTT TTGTTGCATT GTTAGCCAAT CCTAAGGTAT 41166 0 

TAAAAATCTT ACATTCTTGT AGTGAAGATT TATTGGTTTT CTTACAAGAA TTTGATCAGC 41172 0 

TTCCACGCCC AATGATTGAT ACGCAAATTA TGGCACGATT TCTTGGGCTT GGCACATCAG 4117 80 

CAGGATTGGC AAAATTAGCA CAACAATATT TAAATGTAGA AATCGATAAA GGCGCAACTC 411840 

GCACCAATTG GATCAAACGC CCCCTTTCTG ATATTCAACT TCAATATGCC GCAGGCGATG 411900 

TTTGGTATTT GTTGCCACTT TATCATATTC TTGAAAAAGA G TT AG CG AAA ACACCTTGGG 411960 

AACAAGCAGT AAGAGATGAT TGCGAACTTG TTCTTGCTAA AACG CAT AAA TTACAAGAGC 412 020 

GAGATTCAGA AAAAGCGTAT TTAGATATTC CAAATGCTTG GAAATTAAAT CCATTAGAAT 41208 0 

TGTCTCGTTT AAGAGTTTTG GCACAGTGGC GACAAAACGT GGCAATAGAA CGTGATTTAG 41214 0 

CACTTTCTTA TATTGTGAAA TCAGAACATC TTTGGAAAGT GGCAAAGaAT AATCCACGTA 412200 

ATACATCTGA AATGTTAGAA ATGGGATTAA CTGAAAATGA AGTGCGTGTA CGTGGTAAGG 412260 

AAATTCTACA ATTACTTTCT CAAGCAAGAC GTATTTCGTC TAATGACTAT CCAAAATCTA 412320 

TTGAACGTAT TTCTGAAGAT CCTCGTTATA AAAAAACAAT TCGATTATTA CAAGAAAAAG 4123 8 0 

TGAAT AG CTT AACACCAGAA GGTTTAACCC CTGAAATCGT TGCAAGTAAA CGCACTTTGG 41244 0 

AAGAATTGAT TAAATGGGTT TGGAAATATG ATTGTTCTCA AGATAAACGA CCAGAGTTGC 412 500 

TGATTGGCTG GCGAAAACCT ATTGGTGAAA AGTTAGTCGA TGCACTTAAG TAGATATAAA 412 560 

AAAGCGTTGA GTAAATAGCT CAACGCTTTT CGTTTCTACT CGTAGAGTTT TTTCACTTGT 412620 

TCAGGTGTCA TCACTCGTTG GTATTTACTA AAtTCTTTTG C CATATAGT A AGCACCAAAA 41268 0 
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TTTGTTAAGT GATCTTGATC CCCATACAAG TATTTTCCTT CTGCCATAAC GCTATCTTTT 412 74 0 

GGTAAATACT GTTGAGCATC TACCCAATAA ACATTAGGAA TATCTTTGAC TAAATCATGA 412800 

ATAATTTTGT TACTCGCATC AATATCGCCC ATTCTACGAA TTGGTTCTAA ATATTTTTCA 41286 0 

AGG C CAT ATT TTCCCAGTAA ATATCCACGA ATAGGCGATC TGCTAATAGA ACTGTTATTT 412 920 

G CAAAGAT AT AAACTGATTT TGATTTAGCC AGTAATTTTA CAGTTTCAAC AAAGCGAGTT 412 98 0 

TTGAAG TT AT CAATAAAGGA AGCCTGAGCG TTAAATCGGG GAACAGGTTG TCCTCCCATT 41304 0 

CTTAAATCAT AAAATC CAGA AATAAAAATA G CTTTAT ATT TAGGTGAATT AATCTCATCA 413100 

AGTAACCCTT GACACTCAGG TGTTATTTTA TTGTTTTTAT CCACTAAAAC AAAACAAT CC 413160 

TTCCCATCAG TTATGCTTAA AATAGTTGAC TTCCATCCTT CTTGCTTACC AAGATATTTT 413220 

AAAAAATAAG TAAGATGACC AGCATGAGAA TCCCCTATTG TTAAAACTGT AGTTGGGAAG 413280 

TTATTCTCTA AATTAACAGA TAAAACATCT TGTTCAATAT GTTCTTTTTC TTTTCTAATC 41334 0 

TGTTTTCTTT CATAAGTGTT ATAACCAATA ATAAAAAGAG ATGGAATTAG ATAAAGATAG 4134 00 

ATAAATGCCT GTTTAAAGTT TAATTTTGAT TTTCTTATAG GTTGCTCAAT TAAATAATAA 413460 

CTTGTGATAG AAAAAATAAT AGTTAATATA ACAACAATTA AAACTGTAAT GCCTTGAATC 413 520 

TGCTTGCTGC CAGTAATGTA ATAAGCAAAC GCAATAAAAA TCCAATGATA TAAATAAnGA 413 58 0 

GAATAAGGAn ATTTTnCCAA TAAACACTAT TGCTTTATTA GGATAAnCAT TATTTAACGG 413 64 0 

ATGTTATTTT GTGATGTGGT ATGAATAATT AATGCTGTAA AAATACAAGG AAGAATTAAA 413 7 00 

GTAATACCTG GAATAAATGC AATATTATTA TTCATTAAAA AT AAG CAGG A AAATAATAAT 413 76 0 

AATGTACTTA AAATAGCCAG AATATTATTT ACTTG CTT AC TCAATTGAnC CTTGTTTGAA 413 820 

AGATTATGAT AAATTGCCAA CAAAGATCCT ACAAGTAATT CAGGAAATCT T AGG TT AG AG 4138 80 

AGATAATAAA TATTAGGCTG GTGCAATACT TCTTTGTAAA AGTTAGCTGA AACAAAAGAT 413 94 0 

GTCGCAAGTA AAATAAAAAA CAAAATTAGC GTTATAATAA ATAATACTTT AACTTCTCTA 414 000 

AATTTTTTAT AAGCTAAAAT GAGGATTAGT GGAAAAATAA GATAGTATTG TCCTTCTACG 414 060 

GCTAATGACC AAATATGTAA TACAGGGTTT TCATTTGCAC TTAAATCAAA ATAAC CTTGT 414120 

GTTAATCCTA GATAGAAATT AGATAGGAAT GCAATAGCTA ATTCAATGGT TTTCCTTAAT 41418 0 

TTATTGAAAT CATTATAAAT AAAAnTAGCA GACGCAATGA AAGACACTAA TG CCATT ACT 414240 

GTAATAAAAG CAGGATAAAT TCGTTTAATT CGTCGAGTAT AAAATTGTTT AAGCGAAAAA 414 3 00 

GAATTCTGTT GAATTTCTGT GATAATAATA CCCGTAATCA AAAATCCAGA AATAACAAAG 41436 0 
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AAAATATCCA CCCCAAGAAA ACCACCAGAT AACCAGTTTT CATTAAGATG ATAAATAATG 414420 

ACTGAAATAA CAGCAATTGC GCGTaGTCCA TCAATTTCTG GACGTATTTA ATAGATGACA 414480 

TAATATTCCT TTTATAAAAT ATCGACCGCA CCTTGATTTA TGCGTCAAAA GTGCGGTGAT 414540 

TTTTTATGTG TTTTTATACG TTGAATCGGA AATGCATTAC ATCGCCATCT TGTACGATAT 414600 

AATCTTTGCC TTCTAAACGC CATTTACCTG CTTCTTTTGC ACCGTTTTCA CCGTTGAATT 41466 0 

GGATAAAGTC TTCATAAGCG ATAACTTCTG CTCGGATAAA GCCTTTTTCA AAGTCAGTGT 414720 

GGATTACAGC TGCTGCTTTT GGTGCGGTTG CGCCGACTGA AACTGTCCAA GCTCGTACTT 414780 

CTTTTACGCC AGCAGTGAAA TAAG TTTGAA GATTTAAAAG TG CGTAACCA GCACGAATTA 414 84 0 

CTCGGTTCAA TCCTGGTTCT TCAATGCCTA AATCTTGTAA AAACTCGACT TTTTCTTCAT 414 900 

CATCAAGTTC TGCAATTTCA GATTCAATCG CCGCACATAC TGGAACAACT ACCGCCCCTT 414 960 

CTTTTGCAGC AATTTCACGA ACGCGATCTA AATATGGGTT ATTTTCAAAG CCATCTTCAT 415020 

TTACGTTTGC GATATACATT GTTGGTTTTA ACGTTAAGAA GTTGTAACTC TTGATTGCTT 415080 

GCAATTCTTC TTTATCTAAC CCAACAGAAC GAATCATTCC TGCATTTTCC AGCACAGGAA 415140 

GGATTTTTTC CATAACGGAA AGTTCAAATT TAGCTTCTTT ATCGCCACCT TTTGCACGTT 4152 00 

TTTGTAAACG TTGGATTGCT CGTTCACAAC TGTCTAAGTC TGCTAAGGCT AATTCAGTGT 41526 0 

TGATAGTGTC AATATCATCA AGTGGGTCAA TTTTCCCTGC AACGTGTACA ATGTCATCGT 415320 

TTTCAAAACA ACGCACTACG TGACCAATTG CATCGGTTTC ACGAATATTT GCTAAGAATT 4153 80 

TATTACCCAA GCCTTCGCCT TTGCTAGCAC CTGCAACTAA GCCTGCGATG TCCACGAATT 415440 

CCATCGTAGT AGGTAATATG CGTTCAGGCT TAACGATTTC TGCCAATGCG TCTAAACGTG 415500 

GATCTGGCAT TGGCACGACA CCCGTATTTG GTTCGATAGT ACAGAATGGA TAGTTTGCGG 415560 

CTTCAATGCC AGCTTTAGTG AGTGCGTTAA AAAGAGTAGA TTTGCCGACA TTTGGCAATC 415620 

CCACAATACC ACATTTGAAT C C CAT AATTT TTCCCTTATT TTGAGTAAAA CGGTTGTAAA 415680 

ACT AAC C G CA C TTTTAAATT TTAAAACTGT TTAAACGATT GGTCG CTTTG AC CATT C CAT 41574 0 

CTCTGAAAAT CATTTCTACA CAATCAGTGG CTTCATCTAG CACTTTTTCA AGTGCATCGC 415800 

GATCTGCTGG AGATGGTTTA TTTAAGACAT AACCTGCCAC TAAATCACGA TGGCCAGGAT 415860 

GCCCAATGCC AATACGTAAA CGGTAGAAAT TATTGTTATT GCCTAATTG A GCCACAATAT 41592 0 

CTTTTAAGCC GTTATGCCCA CCATGTCCAC CGCCTTGTTT GAGTTTCGCA GTGCCTGGTG 415980 

GTAAATCTAA TTCATCATGA ATCACTAAAA TTTCTTCAGG TTTAATACGG TAAAAACTTG 41604 0 
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CCAATGCGCC AACTGCTTTA CCGCTTAAAT TCATAAAAGT TGTTGGCACT AAAAGGCGTA 416100 

CTTCTTTTCC ATTCACTAAA GTGCGGGCAG TTTTTCCGAA GAATTTACTT TCTGGATTTA 416160 

GTGAAACGTT AAAACGACGA GCCAAGCGTT CAATAAGCCA TTCGCCTGCA TTGTGGCGTG 41622 0 

TATCTGTATA TTTATCGCCA GGGTTGCCTA AACCAACAAT AAGTTTTATT TCTGACATTT 416280 

TTATTTGATG ATGTAAAAAA TAGTGGCGTA TTGTAGCGAA AAAGCCTTAA ATTC TCAATC 41634 0 

TATAAGGCGG AAACAAAGAG GTAATTTATT GCGCGTGATT TATCATTAAA AGTCACCGCA 416400 

CTTTTCCGTT ATAATGGCTA AAATTTTTTA ACAAAGAAGT AAAAATGAAT CAGATTAACA 41646 0 

TTGAAATCGC TTATGCTTTT CCCGAACGTT ATTATTTGAA ATCTTTTCAA GTTGATGAAG 41652 0 

GAATTACCGT GCAAACGGCG ATTACGCAAT CAGGGATTTT AAGCCAATTT CCAGAGATTG 41658 0 

ATTTG AG C AC AAATAAAATT GGGATTTTCA GTCGTCCAAT AAAATTGACT GATGTATTAA 41664 0 

AAGAGGGCGA TCGTATTGAA ATTTATCGTC CATTACTTGC AGATC CG AAA GAAATTCGCC 416700 

GTAAGCGTGC AGCAGAGCAA GCTGCAGCCA AAGACAAAGA AAAAGGAGCT TAAAAATGAC 416760 

CGCACTTTCA TCTGTTGATT TTTGTTTACC TGAACATATC ACGCCAGAAA TTTTTCTACG 416820 

TGATTATTGG CAGAAAAAGC CACTTGTGAT TCGGAACGGT TTGCCTGAAA TCGTTGGGCA 416880 

GTTTGAGCCA CAAGATATTA TTGAGCTCGC GCAAAATGAA GATGTGACAG CTCGTTTAGT 41694 0 

GAAGACATTT TCTGATGATG ATTGGAAAGT TTTTTTTAGT CCTTTGAGCG AAAAGGATTT 417000 

CCAAAAATTG CCAGAAAAAT GGTCGGTGTT AGTTCAAAAT TTGGAGCAAT GGTCGCCTGA 417060 

ATTAGGACAA CTTTGGAACA AATTTGGCTT TATTCCTCAA TGGCAACGTG ACGACATTAT 41712 0 

GGTGTCTTAC GCACCGAAAG GAGGTTCTGT CGGTAAGCAT TATGATGAAT ATGATGTATT 417180 

TTTAGTGCAA GGCTATGGTC ATCGTCGTTG GCAAGTGGGT AAATGGTGTG ATG CAAGCAC 417240 

TGAATTTAAA CCCAATCAAT CGATCCGAAT TTTTGACGAT ATGGG CGAAT TAGTCATTGA 417300 

TGAAGTGATG AATCCTGGGG ATATTTTGTA TATTCCAGCC CGTATGGCGC ATTATGGCGT 417360 

GGCTGAAGAT GATTGTTTAA CCTTTTCTTT TGGCTTGCGT TATCCAAATT TAAGCAATTT 417420 

GATTGATGGC ATAAGCAAAG GTTTTTGTCA TCAAGATCCT GATTTGAATT TAAGCGAGTT 4174 8 0 

TGATTTACCG CTACGTTTAA GCCAGTCTGA ACAACGTACA GGTAAACTTG CAGATGAAAA 41754 0 

TATTCAAGCG ATGAAACAGC TTTTATTAGA TAAATTGGCT CATTCAGAAG CTTTTGATAC 417600 

GTTATTTAAA CAAGCAGTGG CGAGTGCGGT AAGTTCTCGT CGTTATGAGT TATTGGTTTC 417660 

TGATGAAATG TGCGATCCTG ATGAAGTGCG GTCAATTTTA GAAGAAGATG GCGCATTCCT 417720 
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TTCTCAAGAC AACAATTGTA AATTGCTTTA TACCGAAAAT CCGCTGCGAA TTTATGCGAA 417780 

TGGCGAGTGG TTAGATGAAC TTAATATAAT TGAGAGCGAA GTATTGAAAC GTTTATCTGA 417840 

TGGCGAAAGT TTAGATTGGG CGTTTTTATC TGATTTAGCC AATAAAACAG AAGATC CTGA 417900 

AACCTCAATG GATTTGTTGT TGGATT CCAT TTGTAACTGG GTTGATGATG GGTGGGCGTT 417960 

AATTGAATAA TATGTCAGAC AATATTTATT CCGTCTCTCA ACTGAATTCT GCCGCGCGTC 418020 

AGATGCTTGA GGGTAATTTT TGCCAAATTT GGCTAACGGG GGAAATTTCC AATTTTACCC 418080 

AACCTGTGTC TGGACATTGG TATTTAACGC TGAAAGATGA AAATGCACAG GTGCGTTGTG 418140 

CAATGTTTCG TATGAAAAAT TTGCGTGTAG CTTTTCGTCC AC AAAAT GG A ATGCAAGTGC 418200 

TTGTGCGTGC CAATGTGAGT TTATATGAGC CTCGCGGCGA TTATCAG CTT ATTATTGATT 418260 

CTATGCACCC AG CAGGCGAG GGATTATTGC AACAGCAATT TGAAGCGTTA AAAATGAAGT 418320 

TGG CTGCGGA AGGATTGTTT GCTCAAAATC TTAAGAAAAC TCTACCGCAC TTTAGTAAAG 4183 80 

CAGTGGGGAT TATAACTTCT TCTACAGGTG CGGCATTGCA AGATATTTTG CATATTCTGG 41844 0 

CACGCCGTGA T C CAAGTTT A AAAGTCGTCA TTTATCCCAC TGCGGTTCAA GGCAAAGAGG 418 50 0 

CAACTG GAGA AATTGTG CAA AT G ATTGAAC TTGCTAATGC GCGACAAGAA GTAGATGTAT 418 56 0 

TAATTGTGGG GCGTGGTGGT GGTTCATTAG AAGAT CTTTG GTGTTTTAAT GAAGAAGAAG 41862 0 

TGGCGCGAGC GATTTTTCGT TCAACCTTAC CTATTATCAG TGCGGTGGGA CACGAAACAG 4186 8 0 

ATGTGACGAT TGCCGATTTC GTTGCAGATC TTCGTGCGCC AACCCCCTCT GCTGCGGCTG 41874 0 

AGTTAGTTAG TCGTAATCAA GATGAATTAT TGCAACAACT TCGCCATCAA CAGCAACGTT 4188 00 

TAGATATGGC ATTTGATCGT TTATTTACGC GAAAAAGCCA G CGTTTG AAA CAGCTTGCCT 418860 

T ACGATTG CA AAATCAACAT CCACAGAATC AGCTACGTGC TCAACAAGCG AAAAATGAAC 418 92 0 

AGCTTACACA TCGCTTGCAA CTTGCAATAT TGCGTCAGTT TGAAAATACA CAACAAAAAT 418 98 0 

TCACCGCACT TTCTGTTCGT TTAAAACAAA ATCCATTGCC ATATCGCATA CAACGTTATC 41904 0 

AACAAGGCTT AGAGCAGTTA AAAGTGCGGT TAAATTTTTG CGTAAATCGC CAAGTGACGG 419100 

AACGTCAAAA TAAACTGGCA ACATTATGCG GAAAACTGGA TGGCTTAAGC CCATTAAAAG 419160 

TCTTGGCGAG AGGTTATTCC ATTGCGGAAA ATCCGCAAGG TAAAGCCATT GTTAG CGTGA 419220 

AAGATGTCAA TCAAGGGGAC TTCATTACAA CTCAAGTGGC TGATGGAAAA ATTGTGAGTA 419280 

AGGTTTTGTA AGATG CTT AT TTTGTAAAAA TGGAGTGAAA TACTGTTTCA CTTACTTCTA 41934 0 

AAAAAGAATG CTATGACTGT TGAATAAATA ATCATGGCAT TTTGTTTTTT TGTGACTCAC 419400 
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TTCAAACTCT CAACAAAAAG TGGAAACCTT CTATTTTTTT TGTAGTCTTT TAAGCGTATG 419460 
ATTTTGCGTG CAGTTTTGGA GAAAACAGAT GTCAGAAATT CAACATTTTA GCCAACAAGA 419520 
TATAGAAATA TTGGGCGAAC AAACTTTGTA TGAAGGTTTT TTCACGCTTA AACGAATCCA 41 958 0 
ATTTAAACAT AAACTTTTTG CTGGGGGACA AAGTGGCGTC GTAACCCGAG AATTACTGAT 4 1 9S40 
TAAAGGTGCC GCCTCAGCGG TTATCGCTTA TGATCCAAAA GAAGATTCGG TGATTTTAGT 419700 
CGAACAAGTT CGTATTGGCG CGGCTTATCA TCCTGAATCT CATCGTTCGC CTTGGTTACT 419760 
TGAACTCATT GCAGGTATGG TGGAAAAAGG CGAAAAGCCA GAAGACGTTG CTTTGCGTGA 419820 
GAGTGAAGAA GAAGCAGGGA TTCAAGTCAA AAATTTAACA CATTGTTTAA GTGTGTGGGA 419880 
TAGCCCAGGT GGCATAGTGG AGCGGATTCA TTTATTTGCA GGCGAAGTGG ATAGCGCACA 419940 
AGCAAAAGGC ATTCACGGTT TGGCTGAAGA AAATGAAGAC ATTAAAGTGC ACGTAGTAAA 420000 
ACGTGAACAG GCGTATCAAT GGATGTGCGA GGGGAAAATT GATAATGGCA TTGCAGTGAT 420060 
TGGATTGCAG TGGCTTCAAT TAAATTATGC TCAGTTACAA CAAAGTTGGA AACGTAGCTA 420120 
GGGGGCATTA ATGAAAAATA CGTTTGTTTA TCAGGCGGAA AAGCCTGTTA TAAAGTTATT 420180 
ACAAATTACC GATCCCCATT TATTTAAAGA TGAAAGTgCG GAATTATTGG GGGTAAATAC 420240 
ACAAGCAAGT TTTGCTCAAG TATTAAAGGA AATTCAGCAA GAAAATAACG AATTTrATGT 420300 
AATTCTTGCT ACAGGCGACC TTGTGCAGGA TAGTAGCGAT GAAGGTTACA TTCGTTTTGT 420360 
AGAAATGATG AAACCTTTCA ATAAACCTGT GTTTTGGATT CCAGGAAATC ACGACTTTCA 420420 
GCCTAAAATG GCAGAATTTT TAAATCAGCC ACCCATGAAC GCAGCAAAAC ATCTTCTATT 420480 
GGGCGAGCAT TGGCAAGCCT TATTGTTGGA TAGCCAAGTT TACGGTGTAC CACATGGGCA 420540 
ACTAAGCCAG CATCAACTCG ATTTATTAAA AGAAACCTTA GGAAAAAATC CTGAACGCTA 420600 
TACCTTGGTT GTATTACACC ATCATTTATT ACCAACTAAT TCCGCGTGGC TTGATCAACA 420660 
TAACCTTCGT AATTCTCACG AATTGGCAGA AGTGCTTGCG CCTTTTACCA ATGTGAAAGC 420720 
TATTTTATAT GGGCATATTC ATCAAGAAGT AAACAGTGAA TGGAATGGTT ATCAAGTGAT 420780 
GGCAACGCCA GCGACTTGTA TTCAATTTAA ACCTGATTGC CAGTATTTTT CTCTTGATAC 420840 
CTTACAGCCT GGTTGGCGTG AAATTGAGCT ACATTCAGAC GGTTCAATTC GTACGGAAGT 420900 
AAAGCGTATT CAGCAAGCGG AATTTTTCCC GAATATGCAA GAAGAAGGGT ATTAAATTTT 420960 
GTGGACGTTT TAAAAATCAC TTTCATAGTA AGCATTTTTA AAACGTCCAC TATTCTAACC 421020 
CAATAAAAGT GCGGTCGAGT TTCAGATTAT TTTTAACGAT TCGCCAGTTC AAAAATCATC 421080 
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GCTTCTGCTT GGCAACTAAA AGTGAAAGAA GCATTTAATT GAACGCCGTT TTCAACTGCC 421140 
ACGATTTCGC TTTTAATTTC GCAAGGATCG CTTTCTGTAT TGCGCGCTTT CTTTGTTAAT 421200 
CGTTCAAGCA CTGCATTGGC ATCAGCTTCA GTGCTATAAA ATTGATGTAC TTCTACG CT A 421260 
CAGTCAGAGC CATCAATAAT AGTGCCTACA TCAACACAGC TACAAGTTAA GGATTCTTCT 421320 
GCTTTGCATT TTACGGTCAT TTTCTTTnCC TTATATGAAT ATTTTTGTGC ATTTTATCAT 421380 
TTTTCTTTTT CCCCATAAAT GACAAAAATT CAACAAAACA AT C C ATT AAA TGCAAAAAGC 421440 
TCCTGCTAAC TAGTCGTAGC TCTAATAAAA ATAAGGAATA GAGTTGCACT CAATAGATTC 421500 
GGGCTTTATA ATTGCCCAGA TTTTTATTTA TAACAAAGGG TTCCAAATGA AAAAATTTAA 421560 
TCAATCTCTA TTAGCAACAG CAATGTTGTT GGCTGCAGGT GGTGCAAATG CGGCGGCTTT 421620 
CCAATTGGCG GAAGTTTCTA CTTCTGGGCT TGGTCGTGCC TATGCAGGTG AAGCGGCGAT 421680 
TGCAGATAAT GCTTCTGTCG TGGCAACTAA CCCAGCTTTG ATGAGTTTAT TTAAAACGGC 421740 
ACAGTTTTCC ACGGGTGGCG TTTATGTTGA TTCTAGAATT AATATGAATG GTGATGTAAC 421800 
TTCTCATGCT ACAATAATAA CATCAAGTAG TGGAATTAAA GCAATAGAGG GCGGCTCAGC 42186 0 
TTCAGCGCGT AATGTTGTTC CTGGTGCTTT TGTGCCAAAT CTTTATTTCG TTGCAC CAGT 421920 
GAATGATAAA TTCGCGCTGG GTGCAGGAAT GAATGTCAAT TTCGGTCTAA AAAGTGAATA 421980 
TGACGATAGT TATGATGCTG GTATATTTGG TGGAAAAACT GACTTGAGTG CTATCAACTT 422040 
AAATTTAAGT GGTGCTTATC GAGTAACAGA AGGCTTGAGC CTAGGTTTAG GGGTAAATGC 422100 
AGTTTATGCT AAAGCCCAAG TTGAACGGAA TG CTGGT ATT ATTGCGGATA GTGTTAAGGA 422160 
TAACCAAGTA AAAACTGCAC TCACAGTACA GCAAGAACCA CTCAAATTTC TTGATAAGTA 422220 
TTTGCCCTCT AAAGACACAT CTGTTGTGTC ATTACAAGAT AGAGCAGCTT GGGGCTTTGG 422280 
CTGGAATGCA GGTGTAATGT ATCAATTTAA TGAAGCTAAC AGAATTGGTT TAGCCTATCA 422340 
TTCTAAAGTG GACATTGATT TTnCTGACCG CACTGCCACT AGTGTAGAAG CAAATGTTAT 422400 
CAAGGCAGGT AAAAAAGGTG ATTTAACCCT TACATTGCCA GATTACTTAG AACTTTCTGG 422460 
TTTCCATCAA TTAACTGACA AACTTGCTGT GCATTATAGT TATAAATATA CCCATTGGAG 422520 
TCGTTTAACA AAATTAAACG CTAGCTTCGA AGATGGTAAA AAAGCTTTTG ATAAAGAATT 422580 
ACAATACAGT AATAACTCTC GTGTTGCATT AGGGGCAAGT TATAATCTTG ATGAAAAATT 422640 
GACCTTACGT GCGGGTATTG CTTACGATCA GGCTGCATCT CGTCATCAGC GTAGTGCTGC 422700 
AATTCCAGAT ACCGATCGCA CTTGGTATAG TTTAGGGGCA ACCTATAAAT TCACGCCGAA 422760 
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TTTATCTGTT GATCTTGGCT ATGCTTACTT AAAAGGCAAA AAAGTTCACT TTAAAGAAGT 4 22820 
AAAAACAATA GGTGACGAAC GTTCACTTAC ATTGAATACA ACTGCAAATT ATACTTCTCA 422880 
AGCACATGCA AATCTTTACG GTTTAAACTT AAATTATAGT TTCTAATCCG TTAAAAAATT 422940 
TAGCATAATA AAGCACACTT CCACACCAAG TGTGCTTTTC TTTTTATAAA ACAAGGCGAA 423000 
AAATGACCGC ACTTTATTAC ACTTATTACC CCTCGCCAGT CGGACGGCTT TTGATTTTAT 423060 
CTGACGGCGA GAGCATTACA CATATTGATT TTGAAAAAGA GCAATATGCG CCTAATCCAA 423120 
AATGGCACAA GCAAGATGAA TTGCCTGTTT TTCAAAAAGT GCGGTTAGCT TTTGAACGAT 423180 
ATTTTAATGG CGAAGTTGAG TGTTTTTCAA ACATTCCCTT AAAACCAGAA GGCACCGCAT 423240 
TTCAACAAGC CATTTGGCAA GCCTTGCGAG AAATTGATTA TGGCGAGCTT TCAACTTATG 4233 00 
GAGAGCTGGC GTTACGGATT AATAATCCTA AAGCTGTTCG TGCCGTGGGA GGCGCGGTAG 423360 
GGAGTAATCC GATTAGTATT ATTATTCCTT GTCACCGCAT TTTAGGCAAA GATCGTACTT 42342 0 
TAACTGGTTT TGGTGGTGGC TTAGAGGCAA AACGTTTTTT ATTGCAGTTA GAGAAAATTC 42348 0 
CCTATATTGA TAAAGGCACT GAAAACACGA AGCCTCGCTT TTTTAAGAAA TATCACGAAT 423 540 
GATTCCACAA ACCCTTGAAC AATTACTTTC CCAAGCGCAA TCAATCGCTG GCTTAACTTT 423600 
TGGGGAGTTA GCTGATGAAC TGCATATTCC TGTCCCGATT GATTTAAAAC GTGATAAAGG 423660 
CTGGGTGGGG ATGTTATTAG AGCGCGCTTT AGGCGCAACG GCAGGGAGTA AAGCTGAACA 423 720 
GGATTTTTCT CATTTAGGTG TTGAACTTAA AACATTG CCG ATTAATGCGG AGGGTTATCC 423780 
TCTTGAAACC ACCTTTGTGA GCCTTGCCCC GTTGGTACAA AATTCAGGGG TTAAATGGGA 4 23 84 0 
AAATTCCCAC GTTCGTCATA AACTTTCTTG TGTACTTTGG ATGC CCATTG AAGGCAGCCG 423900 
CCATATTCCA TTACGTGAAC GCCATATCGG CGCGCCAATT TTCTGGAAGC CAACCGCTGA 423 960 
GCAAGAACGT CAATTAAAAC AAGATTGGGA AGAATTGATG GATTTAATTG TACTTGGCAA 42402 0 
ATTAGATCAA ATTACCGCTC GAATTGGTGA AGTTATGCAA CTTCGCCCAA AGGGTGCAAA 424080 
TAGTCGCGCT GTGACAAAAG GCATAGGAAA AAATGGCGAA ATTATAGATA CCTTGCCTTT 4 2414 0 
AGGTTTTTAT CTAAGAAAAG AATTCACCGC TCAAATTCTG AACGCATTTT TAGAAACAAA 424200 
GTCG CTTTAA CTTGATTTTA AAACACGAAA AAAACCGACC GCACTTTTTA GCTCGTTGCC 424260 
ATAAAAAATT AAGGGGATTC GATTTCGTTC CCAAGGTGGC ACGCCTAATT CTTGCCAGAT 4 24320 
i H i i-i-l ATA TCTTCTCTTG GGCGTTTCGG ATAATGTTTC ACTTTGCCCG AATAGCCAAA 4 24380 
ACGAATAGAA ATAGGCAAGT TCGTTTTTTC CAGCGTAACA GAGTAATCCT GCCAGTAGAA 424440 
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AATCAGATTA TGTTCATTTT CTTGAACTGT GAGATTGCCT AAATCATCGG GTAAATTTAG 424 500 
CGTATTTTGT TCTAATTTTA GGGTG CATTT GGTTAGATCA GAAAACGGCT TAGTTAAATA 424S6 0 
TAAACTATCT TGATAACGTC GAATAACTTT ATTTACAAGC TGAAATTGAG GATTTG CCTC 424620 
TTCTTTTGCA AAAATGACAT CATTAATAAG CTGTGTAAGC TGGCGTTTAC TTGGCATTTC 424680 
AAGCTGGTTT TCGGCAAGCC ACATACGCAA AAGTG CGGTC TGTTTTGCGA GAGAATATTG 424740 
GCGGAACTGG CACAGTTTAA ATTGGTTTTT AATTTGACAA TGTTCTGTAA AAATTTCGCT 424800 
CAATAAATCA TTAATTAGCT GTTGTTGCTC AAAACAATGT TGTGCTGAAC GTTGCACGGC 424860 
TAAATCAAAA TGAGCCCAGC GTTCACGTAA TTCTGGCAAA ATTTCATTGC GTAAGAAATT 424920 
ACGATCGTAT CGATTATCTT CATTACTTTC GTCTGTAATC CAGTTGAGTT TTTCTTTTTG 424980 
CGCGTAATTT TCTAACTGGG TGCGAGTAAA GCCTAACAAA GGGCGTAAAA TTGGCATGCC 425040 
GAATAATACA CTTTGTTGTT GCATTGCGCC TAAGCCTTTT AGCCCACTGC CACGTTTTAG 425100 
GGCTAAGAAA AAGGTTTCAG TTTGATCGTT TAGATGATGC GCTGTGACCA G CATTTCTTG 425160 
TGTTTGAAGA TATTTTTTGA TGGCTTGATA GCGTGCTTCA CGTGCGCCAG CCTCAATACC 425220 
ATTAGTTTTA TCTACTTGTA CTCGCTCAAT AATCAATGGA ATTTGAAATT GATCG CATAA 425280 
GTCTTGACAA TGTTTAGCCC AACTGTCGGC ATTGGGGCTT AATCCGTGAT GAATATGGAT 425340 
TGCTCGTATT GATAAAGGCG GTAAGTGCGG TTGTTTTTGA CAAAGTTTTG CAAATAAAGA 425400 
AAGAAGTACA GTGGAATCTA AACCACCACT CAAGGCAATA AGAAATGCTT GAGCGGTGGC 42 5460 
TTTTTGGAGT TGTTTTTCAA TGTCTGAAAG TAAATCCATT AGGCAACTTT TTCAGCTTTA 425520 
TAATTGCTGC TTGGATTAAG AATTTCAAAA CGTGCAGCAA TCGTTTGTAA TTCGTAAGAA 425580 
CCCAGTTCAA AGGTGGTTTT CATTACGCCT GCCACTTCAT TATTCATTGC TTCAAAAGCT 42 564 0 
TCAACATCAG ATTTTCCATT GAGCAAATTC GCTAAGAATG TACCTGCGAT TAAATCGCCC 425700 
ACGCCCACTG GTTCAAAGTT GAATTGATAA AGTGGACGGC TTAAATGCCA AACACCCTCT 425760 
GGGGTTGCCA TAATAATTTC AAATGTATCA GGATCGTTGA TTTT AC C CG C ACTGCCTAAA 425820 
TGTTTAACGA GAACTTTCTT CACGCCTTTT GCAATCAACG CATTCACTGC TTTTAATACA 425880 
TCGTCAAAGG TATTAATTGG GAATTCGGTT AGCTGACGCA ATTCGTGTAA GTTTGGCGTC 425940 
ATAATATCCG CCACTGGAAT GGCTTTTTCA ATAAGTGCTT CGCGAACGCC ATTGGCAACT 426 000 
ACGCAGATTT TTTTCGGATG TGGCATTACA GGGTCGCATA AATAAAGCGC ATTGGGATTA 426060 
CGCAGTTTGA TTTGTTCTAA GGCAAAAAGA ATTTGATCGA CTTGTTCCGC AGATCCTAAG 426120 
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TAACCTGATA 


GTAAGGCATC 


ACATTCTTGT 


AGTTTTTCAA 


TATTATCAAG 


TCCAGTTACA 


426180 


ATTTCTCGGA 


TTTGTTCTTG 


TGGAATGACC 


ATACCTGTCC 


ATTTTCCGTA 


TTGTGTGTGA 


426240 


TTAGAAAATT 


GAACGG TATT 


TAATG C C CAT 


ACATCTACGC 


CTAACAACTG 


CATTGGAAAA 


426300 


GTCGCGGATT 


TGTTACCTGC 


AAAACCATAG 


ACTACGTGGG 


ATTGGATCGA 


AAGTACGTTT 


426360 


TTCATTATAT 


TTTCTCCTTT 


GTTGTATATT 


TAGATCAAAA 


AAGAGCGGTA 


AAAATCACCG 


426420 


CTCTTTTAGA 


ATGTTAGTTA 


GCAATAACCG 


TAAGACATTA 


AACGTTCATA 


GCGTCTATTT 


426480 


AATAATTCTT 


CTTTG CTT AG 


TTCATCGAGT 


TCAGCTAAAT 


CTTCTTTTAA 


ACGAAGTTTT 


426540 


AAGTTTTCAG 


CTATTTCTAG 


ATAACTACGA 


TGTGCACCAC 


CTAAAGGTTC 


TTGCACAATG 


426600 


CTGTCGATGA 


GATTTAATTC 


TTTTAAACGG 


CTTGCGGTTA 


ATCCCATTAC 


TTCTGCTGCT 


426660 


GTTGATGCCT 


TTTCCGCACT 


TTTCCAAAGA 


ATAGAAG CGC 


AGCCTTCAGG 


CGAGATCACT 


426720 


GAATAAGTGG 


AATACTGCAA 


CATATTAACT 


TTATCGCCCA 


CGCCAATAGC 


TAACGCACCG 


426780 


CCCGAACCGC 


CTTCGCCGAT 


AACGGTACAA 


ATAACAGGAA 


CGGTAAGCTG 


TGCCATTTCG 


426840 


CGTAAGTTGC 


GAGCGATAGC 


TTCCGCTTGA 


CCACGTTCTT 


CTGCGCCGAT 


ACCAGGATAA 


426900 


GCTCCTGGTG 


TGTCAATAAA 


TGTGATAATT 


GGCAATTTAA 


AACGTTCAGC 


CATTTCCATT 


426960 


AAACGTAACG 


CTTTACGATA 


GCCTTCAGGT 


GCTGGCATAC 


CAAAATTGCG 


TTGTACTTTT 


427020 


TCTTTTACAG 


AACGACCTTT 


TTGATGACCG 


ATAACCATCA 


CTGGACGACC 


ATCTAAACGC 


427080 


GCCAAGCCAC 


CAACAATCGC 


TTTATCATCA 


GCAAAAGCGC 


GATCCCCTGC 


AAGTTCTTCA 


427140 


AATTCCGTAA 


AAATATGTTC 


GATATAATCT 


AATGTATAGG 


GGCGGTTTGG 


ATGACGCGCC 


427200 


ATACGTGAAA 


CTTGCCAAGC 


ATCGAGGTTG 


GCAAAGGTTT 


TTTTAGTGAG 


TTCGTTACTT 


427260 


TTCTTTTGTA 


AGCGTTTGAT 


TTCATCAGTT 


AAATCAACTT 


TATCATCTGA 


TGCTGCTCGT 


427320 


AATGCTTCAA 


TTTTTGCTTC 


TAATTCAGCA 


ATCGGCAATT 


CAAAATCTAA 


ATATTCTTGG 


427380 


TTCATTTTGT 


TATCCTAAAA 


AGACCGTAAA 


AATCGACCGC 


ACTTTATCAA 


GGAATTAAAC 


427440 


TGTGGCGCAA 


ATGGTTTTCA 


CTGGTTGGGC 


AAGCTTATAA 


TCGCTCTCTT 


TTAAAGAGCG 


427500 


ATAAGACAAA 


TAAAAACGCG 


GAACAAATCA 


CAACAGAAGG 


CCCAGCAGCA 


GTATCATAAA 


427560 


AGGCTGATAA 


GATTAGTCCA 


CCTATAATCG 


ATAACATACT 


C AT AAT AAT C 
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CCATAGATTC 


TGGTGTTCTT 


GCAAAGCGAC 


GAGCGGTTGC 


CGCAGGGATT 


ATTAATAAGG 


427680 


ATGTGATAAT 


TAATG CACCG 


ACAAATTTCA 


TACTCAAGGC 


AATAGTCAAT 


GCGGTTAATA 


427740 


TCATTAAAAT 


AAAACGCATT 


TTTTTGATAT 


TAATACCTTC 


AACTTGCGCG 


AGTTCTGGCG 


427800 
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ATACGGTGGT GGAAAGTAAG GATTGCCAAA AGTAAATTAA TGTTGAAAGC ACAATTATCA 4 2 786 0 

CGCCAATGCC GATATAAATT AAATCCGTAT AATTAATCGC GAGTAAATCC CCAAATAAAT 427 920 

AATTCATTAA ATCTACCCGT ACATTCCGTA ATAATCCTAC TGTCACAACC C CGAGAG ATA 4 27980 

AACAGCTATG GGCGATAATA CCGAGCAAGG TATCAATAGA AAATTGCGTA TTG CTTTCTA 4 28040 

ACCACACCAT TGCGATAGCA AGAATTAAGG TAAGCACCAC AATCGCAATG TAAGGATTGA 428100 

CTTGTAAGAA AATACCTAAT GCTACACCGA G CAAGG C AG A ATGAGAAAGG GTGTCGCCAA 42816 0 

AAT AAG C CAT TTTACGCCAC AC CACAAAT A CGCCAAGTGG CGCGGTAATA AGCGAAAGCA 428220 

CAATGCCTGT TAAAAGGGCA GGAAATAAAA TTTCAAACAT TATTTATCCT TATTTTGACA 42 8280 

ATGCACTGCT GATGAATTAC AACCGCACAC ATCTCCGTGT AAAGTGTGAT GATGATTGTG 4 28340 

ATGATGGGTA TAAAAGCCGA CATTTTGTGC GATCTGATTT CCCCATAAAC G CAT AAATGT 42 8400 

TGGATCATTT GAGAGAACAT CAGGCGTACC TGCACAACAA ATATGTTGGT TAATACACAA 4 2 8460 

TACTTCTTTA CTATCCGCCA TCACAATGTG TAAATCG TGT GATACCATCA ATACAGCACA 428 520 

ATTTAATTTT TGTTGGGTCT GATGAATGAG CTGATAAAGT TCTGCCTGAC CAGTAATATC 428 580 

TACGCCTTGA GTGGGTTCAT C TAG CACAAG TAAATTTGGT TTATTTAAAA TTGCACGCGC 428640 

CAATAATACG CGTTGCATTT CGCCACCAGA TAGTTTTTGC ATATTATTTT TGCGTAAATG 428700 

TGAAATTGAA AGTTGCTCAA GTGCGGTGGA AATTTCTTGT GTTTTAATGC CTTTTTTTAG 428760 

CGAAAGAAAT CGCTCAACCG TAATAGGCAA ACT ATGATC C AAATGGATTT TTTGTGGCAC 42 882 0 

ATAGCCAATG CGAACATTTT TACTGTAAAT GACCTCGCCA GAAGTGGGCG TTTGTAGTTT 42 8 88 0 

TAATAAGGTT TTTAATAACG TAGATTTGCC ACCACCATTT GGCCCAACAA TAGTGATAAT 42 8 94 0 

AGAATTGGGA TAGATATTTA AATTAATATC TTGCAGAGCG GTTTTTTGTG CGAAAACCAC 42 9000 

ATTGATATTT TTTAATTGAA TGAGCGGTTG TTGGTTTTGC TCGTTGCGAA TGGCTGTGAT 4 2 906 0 

ATTCATAAAA TCGCCGTGTT TTCAACAATG GTTGGGGAAA ATAACGCATT TTTATAGAAT 42912 0 

TCTCAAGATT TTCTCTCTTA TGTCGAATAT TTAAGCGTTT GAATAATGCT TTTTCCATAG 42 918 0 

GAATGAATTT TCTTTATAAT CTGTAGTCTG AATGCTGAAA ACTTTTTTAT TTTAGGGATG 4 2 9240 

CCAGTGCAAC ACGTAAAATT AGCTCGAGAT AGACGAAAAA AACG CGCAT A TATAAAAGTA 4 2 9300 

GGCGTGTTTT TTGTTGCTAT TTTGCTTATT TTGACAGGCA TTTTACTGAC AATAAAAAGC 429360 

AAACCTGAAG AAAACTCCAT ATTTTCCACC TTTGACAGTG GGGAATATCA CGAGTTAAAT 4 29420 

ACATCGCCAA ATGAAAATTC CACCGCACTT CAACCTGATG AAAACGCCAC CTCTTATGAT 4 29480 
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